Corpora: Voice-recognition systems

David Coniam (coniam@cuhk.edu.hk)
Fri, 23 Jan 1998 08:30:31

Messages sorted by: [ date ][ thread ][ subject ][ author ]
Next message: Ruslan Mitkov: "Corpora: Machine Translation: Special issue on anaphora resolution"
Previous message: Joao Balsa da Silva: "Corpora: CFP: IBERAMIA-98 - Last Call (extended deadlines)"

Ive been doing a little study, first with native speakers and now with very
competent second language speakers on the accuracy of the programs.
The programs - Dragon and IBM - claim about 95% accuracy.

The hassle is at the mo you have to train the program to your voice, so,
with Dragon Naturally Speaking, I got 6 NSs to first train for about an
hour (about 4,000 words), and then to do a sight unseen text (about 1,000)
words which I analysed in terms of t-unit, clauses, groups and words.

Results as you might expect degrade with the complexity of the unit of
analysis. the means for the categories (for NSs) are:

t-units clauses groups words
64.6% 68.7% 78.1% 80.9%

The paper's coming out later this year in TEXT Technology.

Dave Coniam
Chinese U of HK

Next message: Ruslan Mitkov: "Corpora: Machine Translation: Special issue on anaphora resolution"
Previous message: Joao Balsa da Silva: "Corpora: CFP: IBERAMIA-98 - Last Call (extended deadlines)"