audio: How do you really convert sounds such as voice to text? (the process itself)

mercredi 21 janvier 2015

Ofcourse, we have APIs available that we can use.

But if you are not going to use an API, can we just attribute a sound to a specific text?

And how can you actually determine that sound is the same with the sound uttered by the user?

What does the computer hear or read when it get sounds? How is it translated to computer language?

audio