42
« on: April 21, 2007, 05:30:16 pm »
Hi
I am going to start working on a voice control system based on speech recognition for linuxmce. I have a basic and clear idea of how I want the system to behave but I am open to suggestions. I am going to explain my idea so you can critice or add to it. Any idea is welcome. Also, if anyone wants to collaborate is welcome as well.
Basically the system will have two ways to interact. One through the menu and the other by direct order.
Menu: Basically you say menu and the menu displays just like if you were using a mouse. Then you can say another option from the menu and that submenu will open, and so on until an action from a menu is chosen. You can go back by saying back.
Direct order: Basically a list of orders will be stored with the sentence that activates it and the action or actions to be done. It should be totally configurable by the user, letting you change any order. Also, it should work with templates, being able to export your actual configuration, download "popular" order templates from the internet, and so on. Once in your system the template should be totally reconfigurable.
There are two issues here. There should be room to let the system "understand" numbers and maybe for the future understand dates and hours. Something like "Turn off the tv in %time" where %time would be a number of minutes or hours.
Also the system should recognize and alert to the user for incompatible orders. For example, you configure the sentence "switch on" to turn on the lights and you configure the sentence "switch on tv" to turn on the tv. That can cause problems because when you say "switch on tv" the system could understand that you said "switch on" and turn on the lights. I dont know if it will be for the first version, but I want the system to alert the user for this.
The area where I have more doubts on how the system should behave is where you have to select a movie or a song from a large list. I am waiting to use more the system to decide how is the best way to do it.
Ok. So basically this is it. If anyone has any idea on how to improve this model or has any experience with a voice control system and wants to give his/her opinion is welcome.
Hugo