Chapter 54 Actual shooting and testing
"This is so fucking impossible..." Yu Kai still couldn't understand this terrifying speed increase, "You must be using your newly proposed Dream here, right?"
"Yes, the residual network I proposed is very helpful in improving performance, and this help is universal. It is not only for classification tasks, but also for detection, segmentation and other types of tasks.
Powerful performance improvements."
"But if you want to achieve this running speed, you can't actually use the 50 or 100 layers mentioned in your paper, right?"
"Of course, for fast detection algorithms, there is no need to use too deep a network structure. An 18-layer or 34-layer version is enough to meet most needs."
"No, this is impossible." Yu Kai took out a copy of the Dream paper brought back by Robin Li and carefully checked the parameters of Dream's 18th and 34th layers.
"For a network with this amount of parameters, you can only produce 3-5 pictures per second at most." Yu Kai calculated over and over again, but still couldn't get the result right.
Meng Fanqi originally wanted to explain that the huge speed increase came from innovative breakthroughs on the detection side, not the backbone network.
The YOLO method does not use sliding windows or propose optional areas, but directly performs regression on the entire image.
The generalization performance of this approach is very good, and the performance does not fluctuate much for different types of pictures in different scenes, but it is slightly lacking in more subtle things, such as the detection and absolute position of smaller objects.
But as soon as I wanted to speak, I felt it was not safe.
The two technical leaders in front of me are both masters among masters. If they talk too much, they will make mistakes. If they wake up others, it will be a big problem.
"The specific results have been shown to a few of you. The details and principles of the algorithm are definitely not convenient for us to discuss in detail at this stage." Meng Fanqi responded with a smile, "If you calculate that it is impossible, it means that your premise is correct.
Wrong."
"For me personally, there is actually a lot of room for improvement in this score, but my current main interest is not in this direction."
Listen, is this human language?
Wang Haifeng was about to say something, but he was speechless and choked on the spot.
He has developed a lot of detection algorithms in the past two years, and he is still far away from this result. What is the result of what the person in front of him said?
Not that interested in this direction at the moment? So how did you improve the detection accuracy while also speeding it up by more than a hundred times?
You wrote a little casually, right? Are you angry?
"Will you be responsible for continuing to optimize this series of algorithms in the future?" Robin Li is very concerned about this matter. If Meng Fanqi agrees to continue to optimize and upgrade this series, it will actually have an effect similar to recruitment.
"Then it depends on how we signed the contract." Meng Fanqi started to practice Tai Chi. He didn't see the contract, so it's naturally difficult to say this kind of thing.
Li Yanhong leaned back on the chair, held his chin with his left hand, and began to think.
Meng Fanqi does not doubt Li Yanhong’s investment in AI. In the ten years from 2013 to 2023, Li Yanhong has invested more than 100 billion in AI, averaging tens of billions every year.
Even if one percent is allocated from the annual budget, it will be enough for oneself to have enough to eat.
"We also have some picture data here, can we move it over for some reasoning?" Wang Haifeng asked.
"No problem, everything is fine." Meng Fanqi suddenly became vigilant after hearing this. Generally speaking, it sounds normal to require the algorithm to make inferences directly on its own data, but in fact it is not very reasonable.<
/p>
If the data is different, the type in the picture may be completely different, so it cannot be detected.
Corresponding training data is needed to fine-tune the model to be more reasonable.
Combined with the questioning attitudes of the previous two technicians, Meng Fanqi began to wonder if the reason why he was suddenly called here today was because someone simply did not believe his results.
Although I feel a little unhappy, it’s understandable.
Meng Fanqi took out an external camera directly from his bag, "Or you can just connect a camera directly. We won't spend that effort to move the data."
There is a certain risk in directly plugging a USB flash drive into a computer, which is why many major manufacturers later did not allow employees or other personnel to connect any external devices to their hosts.
Meng Fanqi's response became cautious after repeated questions from Baidu's two technical staff.
The previous communication with Robin Li was so smooth that my previous mentality was a bit ridiculous. I still need to be more cautious when dealing with such a major transaction.
"Have you done relevant tests in advance and added interfaces?" Yu Kai felt completely unsure.
Connecting an external camera is the most direct and crude way. Everyone can see the effect of the detection algorithm on the content captured by the camera in real time.
It is almost impossible to fake this thing.
Just now, my eyes indicated that Wang Haifeng proposed to use Baidu's own data for testing. In fact, the subtext was that Meng Fanqi may have used this part of the test data to fine-tune his own model in advance.
To put it bluntly, it is cheating, allowing the model to first learn the data that will be used for testing. After reading the reference answers and then answering the questions, the score will naturally increase by leaps and bounds.
And connecting an external camera to take real-life measurements is equivalent to a third-party examiner giving the questions on the spot, and there is no chance of cheating at all.
Since he had already done testing and adaptation before, it didn’t take long for Meng Fanqi to connect the camera and start running his own algorithm.
The three senior executives pointed the camera at Baidu. The image on the computer screen was quickly framed by an algorithm to select the positions and categories of people, tables, chairs, computers and other elements.
Meng Fanqi deliberately shook the camera, and all the selection boxes were almost close to the target object, following smoothly. There was no such thing as the biggest drawback of current detection algorithms, which is that the detection boxes cannot catch up with people.
Meng Fanqi held up the camera and took pictures everywhere, and found no problems with the recognition of common objects, such as books and water cups.
At this moment, even if they can't think clearly anymore, the two technical leaders are still materialistic and believe in science.
Yu Kai took a deep breath, "This is a very terrifying breakthrough..."
"I wonder what the value of this 'terrible breakthrough' is from the management perspective of an Internet giant?"
To be honest, Meng Fanqi really doesn’t know about this matter. He knows the details of these technologies and the strength of the breakthroughs.
Chapter completed!