Learning with Big Data: The Future of Education

The following is an excerpt from the book.

Learning with Big Data: The Future of Education

Viktor Mayer-Schönberger & Kenneth Cukier

60 pages, Houghton Mifflin Harcourt, 2014

Buy the book »

Luis von Ahn looks like your typical American college student, and acts like one too. He likes to play video games. He speeds around in a blue sports car. And like a modern-day Tom Sawyer, he likes to get others to do his work for him. But looks are deceiving. In fact, von Ahn is one of the world’s most distinguished computer science professors. And he’s put about a billion people to work.

A decade ago, as a 22-year-old grad student, von Ahn helped create something called CAPTCHAs—squiggly text that people have to type into websites in order to sign up for things like free email. Doing so proves that they are humans and not spambots. An upgraded version (called reCAPTCHA) that von Ahn sold to Google had people type distorted text that wasn’t just invented for the purpose, but came from Google’s book-scanning project, which a computer couldn’t decipher. It was a beautiful way to serve two goals with a single piece of data: register for things online, and decrypt words at the same time. Since then, von Ahn, a professor at Carnegie Mellon University, has looked for other “twofers”—ways to get people to supply bits of data that can serve two purposes. He devised it in a startup that he launched in 2012 called Duolingo. The site and smartphone app help people learn foreign languages—something he can empathize with, having learned English as a young child in Guatemala. But the instruction happens in a very clever way.

The company has people translate texts in small phrases at a time, or evaluate and fix other people’s translations. Instead of presenting invented phrases, as is typical for translation software, Duolingo presents real sentences from documents that need translation, for which the company gets paid. After enough students have independently translated or verified a particular phrase, the system accepts it—and compiles all the discrete sentences into a complete document. Among its customers are media companies such as CNN and BuzzFeed, which use it to translate their content in foreign markets. Like reCAPTCHA, Duolingo is a delightful “twin-win”: students get free foreign language instruction while producing something of economic value in return.

But there is a third benefit: all the “data exhaust” that Duolingo collects as a byproduct of people interacting with the site—information like how long it takes someone to become proficient in a certain aspect of a language, how much practice is optimal, the consequences of missing a few days, and so on. All this data, von Ahn realized, could be processed in a way that let him see how people learn best. It’s something we aren’t very easily able to do in a nondigital setting. But considering that in 2013 Duolingo had around one million visitors a day, who spent more than 30 minutes each on the site, he had a huge population to study.

The most important insight von Ahn has uncovered is that the very question “how people learn best” is wrong. It’s not about how “people” learn best—but which people, specifically. There has been little empirical work on what is the best way to teach a foreign language, he explains. There are lots of theories, positing that, say, one should teach adjectives before adverbs. But there is little hard data. And even when data exists, von Ahn notes, it’s usually at such a small scale—a study of a few hundred students, for example—that using it to reach a generalizable finding is shaky at best. Why not base a conclusion on tens of millions of students over many years? With Duolingo, this is now becoming possible.

Crunching Duolingo’s data, von Ahn spotted a significant finding. The best way to teach a language differs, depending on the students’ native tongue and the language they’re trying to acquire. In the case of Spanish speakers learning English, it’s common to teach pronouns early on: words like “he,” “she,” and “it.” But he found that the term “it” tends to confuse and create anxiety for Spanish speakers, since the word doesn’t easily translate into their language. So von Ahn ran a few tests. Teaching “he” and “she” but delaying the introduction of “it” until a few weeks later dramatically improves the number of people who stick with learning English rather than drop out.

Some of his findings are counterintuitive: women do better at sports terms; men lead them in cooking- and food-related words. In Italy, women as a group learn English better than men. And more such insights are popping up all the time.

The story of Duolingo underscores one of the most promising ways that big data is reshaping education. It is a lens into three core qualities that will improve learning: feedback, individualization, and probabilistic predictions.

Excerpted from Learning with Big Data: The Future of Education by Viktor Mayer-Schönberger and Kenneth Cukier (Houghton Mifflin Harcourt, 2014)

Tracker Pixel for Entry


  • BY KatyJordan

    ON August 18, 2014 01:15 PM

    The higher education has had a difficult time since the past few years due to lower student demand, regulatory restrictions and competitive environment. And now too many kids and parents don’t value education enough, not college, but just good basic education. Also American education needs new vision of teaching where there is less teacher talk and more student talk (if you are a student, you should click here to get more information), where teachers focus on how to help students take responsibility for their own learning. As a rule, when it comes to some college assignment, teachers don’t inspire an interest, they just force to do this for another grade. The old education system shoul be changed and modified in the best interests of children.

  • lewis paul's avatar

    BY lewis paul, sscscsc

    ON October 21, 2014 02:29 AM

    This article is very helpful for me. The essay writingcompany has wide knowledge about the education which covers my all question.

  • John Smith's avatar

    BY John Smith

    ON April 23, 2015 12:12 AM

    The higher education has had a worrying time following the previous years because of lower student request, managerial limitations and good environment. What’s more, now an excess of children and worth don’t instruction enough, not college, but just great major education.
    Someone write my essay -

  • I guess it’s a great book.But I am sure that a lot of student’s order superior paper at

  • BY John Noels

    ON May 31, 2015 09:39 PM

    Really thanks for the info which you gave me, keep me more update about the BIG DATA, as my company work on it only.

  • Student can only guess how hard it is sometimes to find and complete a serious task at university.As a former student every man can name several logical reasons for youngsters to look for writing research papers during their studies.

  • Joseph Wilson's avatar

    BY Joseph Wilson

    ON August 8, 2015 02:31 AM

    and this was very surprising to me , at last got to know how this CAPTCHA was invented and by who, still this man is a super genius and a relaxed life enjoying person i can see from this post.

  • StewartStone's avatar

    BY StewartStone

    ON August 25, 2015 10:04 AM

    Global education leader Houghton Mifflin Harcourt today launched Learning with Big Data: The Future of Education, an e-short by Viktor Mayer-Schonberger and Kenneth Cukier, authors of the New York Times bestselling book Big Data: A Revolution That Will Transform How We Live, Work, and Think. The short e-book vividly illustrates the transformative effect big data is having on the learning process and sheds new light on visionary individuals and companies that have marshaled big data to enhance the way we learn.

  • BY Edward Warren

    ON September 7, 2015 02:18 AM

    Interesting thought about big data. But if you ever want to download a free mp3 songs then you should got to

  • BY Steve Hogard

    ON December 21, 2015 09:23 PM

    Thanks for sharing information.

  • BY Steve Hogard

    ON December 21, 2015 09:28 PM

    Amazing book worth to read.

  • Education will be the motivation for the people who wish to make a move exclusive inside their life. Students are usually blessed those write my thesis services who have the required time to choose the best education and learning which make people ideal inside their life. So always require the very best education and learning that have value to fulfill your needs.

Leave a Comment


Please enter the word you see in the image below:


SSIR reserves the right to remove comments it deems offensive or inappropriate.

Helping Children Succeed: What Works and Why

By Paul Tough

Building on his previous work about the importance of personal traits such as perseverance in student success, Paul Tough focuses Helping Children Succeed on how educators, policymakers, and parents can help children develop those attributes.