Wu Hao smiled and shook his head and said: "No, it is just an unfinished product. There are still many problems that we need to solve.
For example, in the conversation just now, it was more difficult for it to understand and process the ambiguous context."
"Ambiguity context?"
Zou Xiaodong was stunned for a moment, then quickly understood and said: "This seems to be difficult for us real people to understand, let alone a machine program.
Boss, I don’t quite understand. Most technology companies are currently working on speech recognition and voice dialogue, and the results are pretty good.
The recognition level of these voice software for our normal speech is also very high, basically reaching more than 99%.
However, the response speed of these software is far less than the recognition speed of our technology, the understanding ability is not as strong as it, and the association processing power is not as good as it.
In addition, in terms of voice dialogue, how did you make the machine language so close to the human voice?
You must know that human hearing is still very sensitive, and it can be quickly distinguished whether the sound is a human or a machine program."
Wu Hao heard a lot of questions from Zou Xiaodong and asked him back: "What do you think is the biggest difference between a real voice and an AI voice?"
Zou Xiaodong thought for a moment, and then replied: "Is it missing Pingchidunde?"
Wu Hao shook his head and said: "This is not the most critical thing. In fact, some voice software currently on the market can already perform simple tones and frustrations."
"That's..."
Wu Hao looked at Zou Xiaodong's puzzled look and said with a smile: "Emotions, all voice programs on the market currently lack emotions."
"Emotions, are you kidding me? How can a program have emotions? This is something that only talents have." Zou Xiaodong shook his head and said incomprehension.
Wu Hao smiled, then controlled the computer to display a schematic diagram of the structure on the big screen and said: "It's more about the temperature of language than feelings.
When we speak, the other party can clearly perceive the emotional changes when we speak. This is emotion, and this is also the temperature of language.
As for the language program, it reacts according to fixed formulas. Therefore, it cannot understand the temperature of each sentence, and naturally there is no temperature in generating speech.
What we need to do is to add an understanding of the language and vocabulary environment into the speech recognition process, and analyze the temperature of the speech and the speaker's emotional changes from different tones."
"I still can't understand how the program can capture the ever-changing emotions displayed by people when they speak. You must know that sometimes slight changes in language and tone can express two completely different meanings and two emotions.
How does the machine tell the difference?" Zou Xiaodong expressed his doubts.
Wu Hao smiled while demonstrating the content on the screen and replied to him: "This is the application of AI technology. Everyone's language and intonation are different, and their emotional expressions are also ever-changing. If we follow the traditional method, we need to deal with these ever-changing
The language intonation context is captured, collected and analyzed to define. If this is the case, the workload will be too great.
Therefore, the learning and evolution ability of AI technology allowed me to find an idea. We can train a basic AI voice program by capturing the massive voice information on the Internet.
Of course, this is just a basic program sample, and we need to make corresponding adjustments based on the user's habits. Let the program learn to adapt to the user. The longer the user uses it, the more accurate the recognition and understanding of the AI recognition program will be." Speaking of this, Wu Hao said with a smile: "This is actually very similar to the process of us real people getting along in real society. After two strangers get to know each other, both parties are gradually figuring out and adapting to each other.
The longer the time goes by, the more familiar the two parties become. Even a simple word, gesture or look from one party can be accurately received and understood by the other party. This is the so-called tacit understanding.
What we have to do is to cultivate a tacit understanding between the program and people, but it is difficult for users to change, and they can only have a subtle influence. So we have to start with the program software, let it adapt to the user, and change the use of it subtly.
who.
Only in this way will human-computer interaction become more tacit.
This is also the reason why when I was talking to 10 before, it couldn't understand my ambiguous context. It didn't adapt to my speaking habits, so it didn't understand the meaning of the vague words I said. <
/p>
Like what, how many, how many, then, where, random, these uncertain and fuzzy words are difficult for the program to understand and process. And this requires us to give these words a basic definition. This definition cannot be rigid.
It has to be adapted to the user’s context and modified accordingly.”
After saying this, Wu Hao looked at Zou Xiaodong and said seriously: "Only when the program understands the emotional temperature in our real people's words, can the program simulate a voice similar to real people's speech."
"No matter what, this is a major breakthrough in the field of AI voice technology. I think once this technology is released, it will definitely shock the world. It represents the true arrival of the intelligent voice era.
To be honest, I can't wait." Zou Xiaodong licked his dry lips and said excitedly.
Wu Hao waved his hand and said: "It's not as exaggerated as you said, but it is indeed a major breakthrough in technology."
"Boss, do you plan to directly market this technology to the mass consumer market, or cooperate with enterprise users to sell technology and related patents, or provide services to them in the form of open source." Zou Xiaodong asked him curiously. This is a question.
This heavyweight technology, no matter who it cooperates with, will bring huge shock to the industry.
"What do you think?" Wu Hao did not answer directly, but asked rhetorically.
Zou Xiaodong thought for a while, and then said seriously to Wu Hao: "If a company wants to become bigger and stronger, it cannot just be limited to a single field. Cooperation with companies can certainly save a lot of things, but the risks are very high. Once a company cooperates
If we obtain more advanced technology, we will face the risk of being abandoned.
So I think we should develop the mass market and use this technology to build our brand among the people and expand our influence. Only in this way can we reduce unnecessary troubles and resistance in future development." "The analysis is very accurate, but this market has huge potential. Monopoly alone will definitely not work. We still need to cooperate with those companies. Of course, we cannot lag behind in the mass market.
So I plan to take a two-pronged approach, and this smart voice assistant is specially created by me for the mass market. How about releasing the video I just demonstrated and what kind of reaction do you think the society and the industry will have?" Wu Hao smiled.