OESF Portables Forum

Everything Else => Zaurus - Everything Development => Distros, Development, and Model Specific Forums => Archived Forums => User Request for Applications => Topic started by: wizzard on May 30, 2004, 01:15:34 pm

Title: Speech recognition
Post by: wizzard on May 30, 2004, 01:15:34 pm
Has anyone thought about implementing voice/speech recofnition in the Z? A \'speech to text\' application would be very useful I\'d think, especially when the Z (atleast my 5600) has such excellent voice recording capabilities.
Unfortunately my own programming knowledge = 0
Therefore I am not capable of developing anything for the Z at all, but am willing to be a beta tester.
Anyone interested?
Title: Speech recognition
Post by: alan on May 30, 2004, 01:48:49 pm
Hi!

I have been using such a prog for two monthes. It was \"Dragon Naturally Speaking\" under win xp. It is not that usefull because of so many recognition errors... I finally learned how to use a keaboard...

I have two objections :
1) can a 400mhz/64mo ram be enough? my \"old\" 850 mhz with 128 mo wasn\'t (too long sentences made it \"forget\" some words.)
2) is it really usefull? I mean if your Z can record, why don\'t you play your recorded voice and have it recognizedby a voice recording recognition on your pc?

Cheers.
Title: Speech recognition
Post by: lardman on May 31, 2004, 06:08:23 am
flite generates speech, Sphinx recognises speech.

There are binaires available for both.

Have a play.


Si
Title: Speech recognition
Post by: spaul on May 31, 2004, 12:08:43 pm
lardman, i\'ve been googling but i couldn\'t figure out whether there is a sphinx binary for the z.  sorry if i missed an obvious place, thx
Title: Speech recognition
Post by: wizzard on May 31, 2004, 01:11:53 pm
The instructions for downloading and compiling Sphinx can be found at http://www.speech.cs.cmu.edu/cgi-bin/cmusp...ldAndRunSphinx4 (http://www.speech.cs.cmu.edu/cgi-bin/cmusphinx/twiki/view/Sphinx4/HowToBuildAndRunSphinx4)
Is anyone interested in compiling this for the Z?
I\'d have tried but my programming expertise can comfortably fit in a nutshell with the nut and room to spare.
Title: Speech recognition
Post by: lpotter on May 31, 2004, 03:38:46 pm
yes, there is work being done on something like this. I can\'t say who, or what, or when, if ever, it will get released.
Title: Speech recognition
Post by: wizzard on May 31, 2004, 03:44:29 pm
Quote
yes, there is work being done on something like this

That\'s good to know. If it is ever released, I hope there\'ll be a version for the 5600 (since that\'s the one I have).
Title: Speech recognition
Post by: lardman on June 02, 2004, 09:37:28 am
I\'ve already compiled it (about a year ago in fact).

http://sgp.zaurii.net/binaries/sphinx2-0.4_arm.ipk (http://sgp.zaurii.net/binaries/sphinx2-0.4_arm.ipk)
http://sgp.zaurii.net/binaries/sphinx2-0.4_arm.tar.gz (http://sgp.zaurii.net/binaries/sphinx2-0.4_arm.tar.gz)
http://sgp.zaurii.net/binaries/sphinx3-0.1_arm.ipk (http://sgp.zaurii.net/binaries/sphinx3-0.1_arm.ipk)
http://sgp.zaurii.net/binaries/sphinx3-0.1_arm.tar.gz (http://sgp.zaurii.net/binaries/sphinx3-0.1_arm.tar.gz)

Not sure about the packaging.

Enjoy.


Si
Title: Speech recognition
Post by: BalroG on June 02, 2004, 10:30:01 am
Wow, this sounds cool.

lardman, how well does it work on a C860? I guess you have to wangle some way of getting a mic set up for the Z for this to work though.
Title: Speech recognition
Post by: ads on June 06, 2004, 11:41:22 am
Hey Lard man ,
I\'m playing around with this....thanks for the info

I have had no joy so far....can you give some basic advice to get us started? It would be a great help.

Thanks ads
Title: Speech recognition
Post by: ads on June 06, 2004, 11:42:37 am
Ps  c750 Qtopia cacko rom
ads
Title: Speech recognition
Post by: alan on June 06, 2004, 11:44:53 am
woooooooooow!

I MUST give this a try...

Does it work with pdaxrom?
Title: Speech recognition
Post by: ads on June 06, 2004, 11:57:56 am
As far as I know it has more chance of working with pdaX than anything else but I\'m using Qtopia and so far no joys...  

I really wanna get it going so I can use my zaurus to launch and control programs using only speech....

ie mp3 player, flite for reading books and some database retieval ( address retrieval) without having to pull out my zaurus

Its a pretty distant dream but if I can get the building blocks then I\'ll be happy.

ads
Title: Speech recognition
Post by: lardman on June 06, 2004, 11:59:15 am
@ads: To tell the truth I don\'t know.

If I remember correctly, sphinx is mostly just the libs which allows you to create your own apps which can do speech recognition. That said, there are a couple of example apps in the tarball (I think). I just compiled it as someone had asked me to, I\'m not sure that I ever got round to using it myself, though I think others did. Assuming the devnet is back up (and has its old data) you could try a search there.

I\'ll have a look at it during the week and see whether I can remember (but you\'re probably better off just reading the release notes/README/INSTALL files in the tarball as then you\'ll know pretty much as much as I do).

@alan:

It\'s command line so it ought to.


Si
Title: Speech recognition
Post by: ads on June 06, 2004, 12:03:56 pm
Lardman, thanks for that....I have had a look at the release notes, and it gets kinda involved....I was looking for an easy solution.

I\'ll have a real go at it later when I get the time but it looks like a weekender to get it all going.
Thanks for all your help.

Ads