Transcript: Dr Kai Fu Lee in Yale-NUS Book Launch AI Superpowers

Watch the Video here:

Sinoventures' Dr. Kai Fu Lee talking about AI superpowers

Nai-post ni Tech News Online noong Miyerkules, Nobyembre 21, 2018


[Synthesized Speech in the likeness of Pres Donald Trump]


Dr. Kai Fu Lee: So this slide shows you the power of deep learning because it is built on deep learning technologies and it could re-synthesize speech like President Trump and that was not him speaking, it was by the synthesis system.


And the deep learning is a big technology breakthrough that can do the following in a single domain. We can write one domain only with a huge amount of training with superhuman performance.


So when trained on Amazon clicks, it learns how to make money, the most money from people entering the sites. When trained on large amounts of President Trump’s speech, it talks like him. When trained on CT of lung cancer and non-cancer, it learns to distinguish them from each other. And when trained on GO games, it runs to play the game GO with human capability. So, that has been the most amazing breakthrough in the past 63 years of AI history.


And in terms of applications, it actually oversaw many fields. Many people get confused and think only robots and Autonomous Vehicles are AI. But actually the same deep learning algorithm goes through these four waves of artificial intelligence. Let me start from the first wave. It’s the Internet wave.


Naturally, (unintelligible) in English we collect the most failure and it’s automatically labeled. We are guinea pigs labeling (candidate) for Facebook, Amazon, and Google when we use them and cook, it learns what you like and don’t like.


And then – and those what I’m going to show you and that’s how the websites are no better than usual but things you want to click on. [Background noise] And today has so much revenue.


What’s more, each of the implementation of the deployment for internet websites comes within objective function maximizer. So, Amazon can choose to maximize revenue or profits. And Facebook can choose to maximize virality or minutes per user.


So, imagine the job of the CEO suddenly got a lot easier. And that’s also why these internet companies are near (shooting down) their companies, just because they have a lot of training data and (financial) (that helps them) make more money.


Now, second wave. It’s applicable equally to businesses that have lots of data – to banks, insurance companies, hospitals, government branches, so on and so forth. Anyone who’s had a data repository can now use it to apply AI to it.


Take as an example (unintelligible) bank. Whenever I meet with bank CEO, there’s always a fourth person sitting in the back of the room looking at the record. That is the head of the data center because that person generates no profits, no revenue. It’s merely a cost center storing data for archival purposes.


But guess what, that data center and that person has just turned into a mountain of gold. Because with all the customer transactions and history, a company, a bank can now optimize customer asset allocation, can help make more money, can help minimize the fault, can help detect credit or fraud, so that is phenomenal and similarly for insurance companies and so on.


But before you start thinking, it is only to use by – to the used by traditional companies. And let me show you an example where it tend to use to disrupt companies.


We found that AI long application and it’s (an app every time) [Audio Break] and when you download it, you can fill in all the blanks such as your income, your name, your workplace, and so on about 10 fields. But also, you have to upload your phone data up to the long application decision maker.


And that data isn’t everything on your phone obviously. It’s only things permitted by our iOS and Android. It’s the same things that Facebook and Twitter take; nothing more and nothing less.


And – well, the system will do is decide whether to give you a loan based on what you entered and what you uploaded. And – so think about it. If you were to walk outside the National University and run in through 3000 strangers, can you pick 1000 of them to each, you loan SG$500. And there goes (SG$500,000). What percentage of default rate might you get? Very high, right? Maybe 80%, 90%. Okay, Singapore maybe 60%. [Laughter] But still a high number.


So, guess again what this – default rate for this loan company is 3%. So, how does he manage to do that? Because it has so much information from the phone that you regard as useless. It’s actually you little bits of valuable information.


Well, together correlated with each other to make a very smart decision. This obviously includes all (unintelligible) typed but also how fast you typed it, that makes the difference because if you’re faster you’re probably copying and type slower.


Also has information like what date of the month is it. Is it before payday or after payday? Before pay day, it’s a good loan. After payday, not so good because you just got paid, why do you need money?


We would also have the information on (what else do you have). If you have the games. You have gambling apps, you have illegal apps or perhaps you have all serious knowledge apps. That makes a difference on your loan. The model of your phone, that make a big difference – very big difference.


And also, once you come (back to us), I’m going to keep (right in it). Are they real people? Is the person you called “dad” really your dad? So on and so forth. These are all things that can be found merely using the data you upload and it’s available on the internet.


So, just out of curiosity we asked how many such features are there. There were 3000. And just out of curiosity we asked, what’s the least important feature? It turns out to be your battery level.


So, why does that matter at all? Well, if your cellphone glass, LCD, who plugs your phone all the time, you’re probably a little bit correlated with someone who returns loans. If you’re someone who keeps battery and your phone run out of battery, you’re probably little bit correlated to someone who defaults.


So obviously, that is not an important factor. That might be 1 billionth of all the information that’s still there, but all of it is considered. And in aggregate it’s something (worth looking into).


Moving on to the third wave, that’s when machine starts to see and hear and sense with things like cameras, microphones, e-sensors, movement sensors, [background noise] maybe reconstruction devices (unintelligible). All of that gathers data that before was transmitted, non-existent and now can be used to make smart decisions.


I’ll give you an example. You all know, (that video that he suddenly) (unintelligible) the police mentioned. [Laughter] Do you know why? Because in the last concert he gave, over 20 people were arrested from the most wanted list. Because the stadiums were – they were – they had cameras installed. And the cameras were connected to face recognition to the criminal database.


And as a result, the police apprehended some number of people, to some of them, “Oops, sorry. Your ID looks good. My mistake. Please enjoy the concert.” But to the unfortunate 20 something people who were actual criminals that (thought) for one night sneak out from wherever they were hiding enjoy a concert – they were caught.


This talk is not about whether that’s a great act or when that worries people, but it’s about power of face recognition. No human can possibly even if you took the 1000 best policemen in China, they couldn’t possibly identify 25 criminals out of a field of 60,000 people in the concert because we simply don’t have the memory or the face recognition capability. And we get tired, the fatigue overcome us. Can’t possibly do that.


So, this hopefully the lone example and the example at that concert will show you that AI is far surpassing our capabilities. If you satisfy the requirement, one domain, lots of data.


Now, this – in the third wave also will have autonomous stores that recognize your face, your movement, your gesture, your intent that you pick up this bottle of water and you may be bought it, have put it in the basket and it charges you, or maybe you’ll look at with disgust and put it back. That indicates something. Turning an offline store with the same power as Amazon or online. What do you think from it? That just put there then security comes up. That’s it. Don’t drink from my water.


So, all these things are the kind of (unintelligible) in wave three and it will automate a lot of things with smart vision and hearing.


Wave four is robotics and autonomous vehicles. Most people know a lot about those already. But I can tell you, robots are actually difficult to do everything humans can do. I’m on the board of Foxconn. And I can tell you, it’s not going to be easy displacing people who make iPhones, those that require the level of dexterity and hand-eye coordination.


However, there are many jobs that are stationary; repetitive that can be displaced by robots. For example, watering plants selectively with just the right amount of water and fertilizer based on the growth as observed by computer vision. Such as picking fruits, such as doing dishes, doing inspection on assembly. So we’re still talking about tens of millions of jobs, but just not very high dexterity jobs.


And autonomous vehicle is of course the biggest of all breakthroughs that will change the way we transport ourselves. Logistics delivery, it will make life so much more efficient, convenient. It will make the air safer; there will be fewer fuel cars in the road. You will no longer have to buy cars, and it will be a lot safer – especially over time. Because one very important thing is more data makes AI possible, and more, more, and more data makes AI better.


So the moment that autonomous vehicles will be launched hopefully, it’s pretty safe. And then five years later, it becomes extremely safe. Five more years, even safer. So it keeps getting better and better. So that’s the fourth wave.


Each of these waves represents something like 5% increment to say to the GDP. Also, represent some 5% of jobs displaced. This slides shows you the key things of one makes AI working. Massive data, like very good labeling and a single domain. And usually, you need a lot of compute power and some AI experts.


So who has the AI experts? Obviously, America – more specifically, America with some British and Canadians are by far the leaders in the world. So, why – we just thought when I say that China has a chance because the three very important observations to make.


The first is they’re in very few breakthroughs. So, people generally assume there are lot of breakthroughs because you read about them in the paper. But actually, all the breakthroughs that you read in the paper are built on deep learning or similar technologies.


And deep learning is fairly well understood and nor can we expect, you ask to continuously come up another big breakthrough because after all, in the last 62 years, there was only one breakthrough. Why do they think there will be five more in the coming years?


Secondly, because the technologies are well understood, we’re now moved beyond technology, discovery, and disruption. We’re moving to taking the mature technology and applying them. Just like the early days in the internet. The discovery of TCP/IP, amazingly important. The building of the web browser, amazingly important. The invention of electricity, amazingly important. But there were never was a TCP/IP 2.0 that disrupted everything. There was a 2.0 but it was a small net. There were never was electricity 2.0 that destroyed everything. It was the 1.0, when ground work was done and that all the application built on top of TCP/IP, the browser, of electricity. So I like to say that deep learning is like these things. So we’re in the era of implementation.


Thirdly, AI is a very open domain. All that is known is largely in the open source. If you want to learn AI and taking up courses, all of the open sources out there you can build when you want to build. So, companies compete on this implementation and the ideas, and how quickly can they make money, not on the breakthroughs. Because the breakthroughs are done, we understand them. They’re in the open space.


Now, how does China build on these three things? Well first, China has a lot of great AI researchers and engineers. They are not as similar as the American ones, this chart you see of all the AI papers 42% have Chinese authors, and 40% – 2% of all the authorss are Chinese.


And Chinese can innovate in these products. You see, the Chinese used to be copycats, but Chinese had become equal to US in between stage and now, has had leapfrogged to build $300 billion of value in the orange slice that you see.


And the Chinese entrepreneurs, they are hungry. They’re good at finding business opportunities. They work hard. They built barriers because they’re surrounded by copycats. They have to build products that are uncopiable. The only uncopiable product is one that takes – that is built a moat around it. That’s also hard to copy.


So, for example, Meituan’s 600,000 delivery people in the infrastructure vehicles cost you billions of dollars. For example, TD going into, buying these vehicles, leasing them, insurance, gas stations, that locks up the domain. So that’s the Chinese method of competition. It’s very good fit with AI. They’re built up over time. They’re capital intensive. They use a lot of money and then they build a moat that’s very hard to imitate.


The fourth reason is China has a lot of money. A lot of money flow to China. AI – this is not government money. This is private money. And a lot of this money goes into funding Chinese AI companies. Chinese AI funding exceeded US in the last year. [Cough]


And as an example, these are the five of our unicorns. So these are AI companies we invested in that become over $1 billion in market capitalization and total value is $21 billion. And the newest of these companies was founded only two years ago. These are concepts that I think are unheard of or perhaps not even believed in Singapore. But now you’ve heard it, so you should believe it.


The fifth reason is the power of massive data. The right chart shows you, the more data, the better it performs. In fact, in AI we have a saying called “There’s no data like more data”. Anybody care to guess who said that? A gentleman named, Dr. Bob Mercer, the founder of Cambridge Analytica. [Laughter] A very famous esteemed AI researcher who turned into additional loss. [Laughter]


And so in the age of AI, if data is getting oil [Cough] and China is the new OPEC. China has not only more people but also more usages. Chinese people use take outs more because in China you can get food delivered to you from 500 restaurants in 30 minutes, costing US$0.70 per delivery and that is the amazing thing that causes Chinese people to have more depth in usage and that’s where the data comes from.


A lot of people in the West assume that Chinese people just don’t care about data, companies’ trade data. Government is always aware. [Cough]. That’s not true. The companies behave much less [background noise] (than Western do things). But it’s just that there are more people and they use the data more.


In particular, I want to point out the use of mobile data is particularly important because mobile data is the most valuable data. It is – you are paying for something. It’s not just “put it on the page” but you are paying something, and in it you want something and that can be used as a rocket fuel to learn a lot of great AI. [Cough]


And finally Chinese company [cough] strongly supports AI. [Cough] And Chinese policies tend to be techno-utilitarian, which means try it out and then regulate it only if issues occur. So with mobile payment that may have (been stopped) in the US because credit cards may raise the issue that software companies can be hacked or can be fraud, or can’t be trusted with managing your money. But China will trust Alibaba and Tencent as long as they live up to their worth and they were proven trustworthy. So they’ve taken over the credit card space.


And also China has an AI plan on the left side wanting to be the global best by 2030. And then with that plan, each enterprise in each city may come up with specific plan.


So, for example, the state owned banks. Once the government said AI is important – they might procure some AI software. And city of NanJing said our schools are very good. Let’s build the world’s largest AI science park. And China is happy to build a new city called “Xiongan” which has autonomous vehicle built-in with top layer for pedestrians, bottom layer for cars. Thereby avoiding the kind of accidents we saw in Phoenix with Uber autonomous.


So as a result, we anticipate that China will catch up with US somewhere between now and the next five years. And that most important message to take is that China and US will be by far the co-leaders in AI.


Who exactly will be ahead, really depends on a lot of things. This shows the projection China’s slightly ahead but only in implementation. US is currently ahead in research. So new technologies invented that put US back in the lead. But what is clear is that in this race, there are not three medals like the Olympics. There are only two medals. And they belong to US and China. Who gets the gold remains to be seen but there is no bronze medal.


AI will create a huge amount of value, about $16 trillion net additional GDP, but it will also bring a lot of challenges. And due to the interest of time, I’m just going to cover one issue which is job displacement. That is with AI being able to do so many jobs, are all our jobs going to be taken away? Well, it is not.


If you think about what AI cannot do, there are two sets sorts of things. One is creative things and the other is things that require empathy, compassion, people-to-people connection. So these three attributes separate all the jobs in past that we do will find and in fact on the lower left, all the jobs will be taken by AI. And that’s of concern and we need to do something about that.


But the jobs on the lower right is a perfect example of human AI symbioses. With AI tool helping scientists find more cure for cancer.  On upper left, we will find that AI can be the analytical core, while the human provides the warmth. For example, in the case of a physician, AI can do the diagnosis then the physician connects with the user, to the patient. Here’s – gets the patient to tell all the problems and enters it in AI engine and provides the comfort and confidence, thereby maximizing likelihood of recuperation but also making cost of healthcare much lower.


And then on the upper right side is where humans will excel in both compassionate as well creative skill sets. So we do have something to worry about. In the lower left, well, we also have a lot to celebrate on the other three components.


But the most important thing I think is we’ll look further out in the future. I think your children, for those of you who are students are children, for those of you who are teachers, your grandchildren; they will probably enjoy an amazing life. Because by the time they will get the effect of AI, they will only see that AI has liberated us from doing routine jobs. Allowing us to have a lot more free time to love the people we love, to do the things we’re passionate about and to have time to think about what it means to be human.


And for those of you who are little fearful of AI, remember it is just a tool. We’re the only ones who have the free will. We will control the AI tools and we get to write the ending to the AI story. Thank you.



Transcript: Mark Zuckerberg at Mobile World Congress 2014

Time Code: 0:00:08.3
[Theme music]
David Kirkpatrick: Hello. I’m David Kirkpatrick. I run something called “Techonomy Conference” which is a place where technology leaders come together with business and government leaders to talk about how technology is changing everything. Conference happens in November, south of San Francisco.
Now, I met Mark Zuckerberg in 2006 when he was 22 years old and Facebook had nine million users. I was so impressed with him from that very first meeting and impressed with his long term vision and the scope of his thinking that I ended up writing a book called “The Facebook Effect” which by the way is published in both Spanish and Catalan.
And now Facebook has 1.2 billion users. So from September of 2006, 9 million, today, 1.2 billion. It’s the largest communication service of any type that’s ever existed.
There’s one movie that portray Mark as an anxious, angry and vindictive person. But that is not the way I have ever found him. In fact it’s actually his sincerity and his earnestness that most impresses me. He thinks a lot about how his company is changing the world for the better. And I think, when you hear him talk you’re going to understand what I mean.
So, Mark, please come out and join me.
Okay, Mark. So, clearly there’s one topic that we have to start with. It’s been on everybody’s lips for the last week or so. You bought WhatsApp for $19 billion which all of us, once we got over, you know, our shock at that, you know, some of us feel like we understand it. But tell us here at the Mobile World Congress, which is really the world’s major gathering of mobile communications – which is an industry that WhatsApp is a big part of, why did you do it and what does it mean?
Mark Zuckerberg: Well, WhatsApp is a great company and it’s, it’s a great fit for us. Already almost half a billion people love using WhatsApp for messaging. And it’s the most engaging app that we’ve ever seen exist on mobile by far – that’s 70% of people who use WhatsApp use it every day – which kind of blows away everything else that’s out there.
What we see is that WhatsApp is on a path to connecting more than a billion people. And there are very few services in the world that can reach that level. And they’re all incredibly valuable.So, when we have the opportunity to be a part of his journey, I was just really excited to take you up on that and to help him realize his dream of connecting a lot more people.

Continue reading “Transcript: Mark Zuckerberg at Mobile World Congress 2014”

Futuregen is a TEDxXavierSchool Event Sponsor

Futuregen is honored to be a sponsor to TEDxXavierSchool by providing the transcription for the talks. The inaugural TEDxXavierSchool will take place on the Xavier School campus on the morning of Saturday, February 18, 2012.

Described as an “intellectual circus,” the event will bring people together for presentations by global thoughtleaders, focusing on the theme “Innovation built on tradition.”Kicking off at 8am,TEDxXavierSchool “Innovation built on tradition” will be a half-day event to exploreways in which we innovate based on our history and our own experiences.

Interested attendees must secure a free ticket to participate in the limited-space TEDxXavierSchoolevent. Ticket applications are available at and must be received by11:59pm on Friday, February 17, 2012 — or the day before the event begins — to be considered foradmission. All ticket types, regardless of category, are FREE of charge.Innovators scheduled to speak in person at TEDxXavierSchool include
•Raynard Raphael Lao — a Xavier High School student, who is also a champion public speaker atboth local and regional competitions
•Brian Maraña — International Programs Coordinator of Xavier School who has transformed theway students learn from the world
•Tony Meloto — Founder of Gawad Kalinga, providing countless homes to the homeless andbuilding them into communities, and speaker at the World Economic Forum
•Dodie Ng — Games and apps creator who also founded a robotics organization and team for theyouth, while also being a Xavier High School student
•Mark Ruiz — Co-Founder of Hapinoy and Founder of Rags2Riches, providing social businessenterprise and microenterprise development as a living means to some of the poorest people
•Brian Tenorio — Internationally-acclaimed, New York-based designer who has altered the way development is done, through DesignRegular.

Updates may be accessed through the event’s Facebook page at TEDxXavierSchool , or through Twitter via the #TEDxXS hashtag

TEDxXavierSchool Transcript

Transcription Services in Singapore

Yes we do Audio to Text services in Singapore. These transcription services help business and legal firms with converting their audio materials from Annual general meetings, group discussions, research interviews, legal depositions et al into Microsoft Word document format.  This is how services transcription process works.

Companies have found several benefits to getting transcription services for their audio files:

One is to transcribe files to increase web traffic.
Another is to use transcription firms to produce captions/subtitles that can be read by a new market segment.
Finally, new US laws are in effect which require the use of transcription services:

  • 21st Century Communications and Video Accessibility Act (2010)
  • Workforce Rehabilitation Act Section 504 and 508 (1998)
  • Americans with Disabilities Act (1990)

In summary, these laws require companies and government agencies to ensure that the content is accessible to employees and the public – specially those with disabilities that prevent them from accessing audio content. Our repeat clients appreciate our fast turnaround times, high accuracy (at least 98.5%) and particular skills transcribing non-native English speakers.  We also adhere to strict Singapore laws on PDPA (Personal Data Protection Act) with our own PDPA compliance officer.



Transcription Service in Singapore: Sharing The skills

Futuregen is now working closely with clients of SPD (Society for the Physically Disabled) to train vision impaired people with necessary skills to transcribe audio into microsoft word documents.

While the transcription service training project seems to be too ambitious for the visually impaired, they do have a higher level of sensitivity to sound – their listening skills are much better than normal people. Hence when SPD contacted Futuregen to share its skills in transcription service, we readily agreed to give it a try.

The initial results are heartwarming. Some of SPD’s clients were able to transcribe the audio with 94% accuracy. This is not bad for a first try where usual participants would get 70-80% accuracy on their first try.

In case you are wondering, SPD’s Clients who were visually impaired used a software called JAWS to navigate around their PC without the aid of sight. Our plan is to engage SPD’s clients in offering transcription service in Singapore.

Movie Subtitling: Enemies of the People

Our Transcription team is proud to have done the transcripts for the sub titles for “Enemies of the People” a movie on the Killling fields that is now showing in the US. We feel proud to have been part of this video transcription project.

Please visit the Enemies of the People homepage.

How to record Minutes of Meetings

The accuracy of audio transcripts depend in large part on the quality of the audio recordings. Some common challenges we see with digital recordings when you are recording meetings are as follows:

1. Noisy environments with background sounds.
Of course, the obvious solution is to move the meeting elsewhere. A very public and loud place isn’t the ideal location for meetings anyway. However, if that is not an option, then consider reducing the background noise with the use of high quality microphones like Behringer C1-U. Other option is to digitally enhance the recording by use of computer software to minimize the noise and amplify weak audio levels

2. Several people speaking at the same time.
Consider switching from using a single digital recorders to one that is computer based. This allows you to setup multiple microphones. When placed strategically, it can save the conversation from different channels (mics) into separate audio files.

3. Never Use Voice Activate mode.
To conserve recording capacity, most audio recorders have a “record when voice is present” (voice activated mode). While this does produce recordings that contain less dead air, it also has the unfortunate side effect of having ‘missing’ words.

4. Billingual Speakers
Be conscious of bilingual speakers that drift from English to another language. Chairpersons would be wise to restate the speakers’ non english comments into english and confirm its correctness.

5. Some unusual sources of noise include shuffling papers, coffee cups, dinner plates and cellphones.
Eliminate or minimize the impact of these source by banning them altogether, or if this is not an option, placing the microphones away from such sources.

Check out related topic on how to create minutes of meeting.

We provide audio recording facilities for conferences and meetings using our multi-channel digital system. Email marketing[at] for details.

Punctuation Rules for Comma Use

Rule 1. To avoid confusion, use commas to separate words and word groups with a series of three or more.
Example: My $10 million estate is to be split among my husband, daughter, son, and nephew. Omitting the comma after son would indicate that the son and nephew would have to split one-third of the estate.
Rule 2. Use a comma to separate two adjectives when the word and can be inserted between them.
Examples: He is a strong, healthy man.
We stayed at an expensive summer resort. You would not say expensive and summer resort, so no comma.
Rule 3. Use a comma when an -ly adjective is used with other adjectives.
NOTE: To test whether an -ly word is an adjective, see if it can be used alone with the noun. If it can, use the comma.
Examples: Felix was a lonely, young boy.
I get headaches in brightly lit rooms. Brightly is not an adjective because it cannot be used alone with rooms; therefore, no comma is used between brightly and lit.
Rule 4. Use commas before or surrounding the name or title of a person directly addressed.
Examples: Will you, Aisha, do that assignment for me?
Yes, Doctor, I will.
NOTE: Capitalize a title when directly addressing someone.


English Possessive Determiners

From: McGraw Hill’s 2010 GRE :
English possessive determiners (my, our, your, his/her/its, their – sometimes called possessive adjectives) must match the person and number of the possessor and not the noun phrase to which they are linked.

Richard likes his hot dogs with lots of relish. The word his is third person singular to match with Richard, NOT third person their to match with hotdogs.