2
24 | NewScientist | 25 February 2012 TECHNOLOGY or just make you feel trapped. “Today’s internet search tells us what the world already knows,” explains the CEO of YossarianLives, J. Paul Neeley. “We don’t want you to know what everyone else knows, we want you to generate new knowledge.” He says that metaphors help us see existing TYPING “love” into Google, I find the Wikipedia entry, a “relationship calculator” and Lovefilm, a DVD rental service. Doing the same in YossarianLives, a new search engine due to launch this year, I might receive quite different results: “river”, “sleep” and “prison”. Its creators claim YossarianLives is a metaphorical search engine, designed to spark creativity by returning disparate but conceptually related terms. So the results perhaps make sense if you accept that love can ebb and flow, provide rejuvenating comfort A mad idea, but it just might workJacob Aron PARAMOUNT/FILMWAYS/THE KOBAL COLLECTION < long time,” says geneticist and blogger Daniel MacArthur. Oxford Nanopore is set to face stiff competition, however. Jonathan Rothberg, a scientist and entrepreneur who founded rival firm 454 Life Sciences, also announced at the AGBT conference that his start-up company, Ion Torrent, will be launching a desktop sequencing machine. The Ion Proton identifies bases by using transistors to detect hydrogen ions as they are given off during the polymerisation of DNA. This device will be capable of sequencing a human genome in 2 hours for around $1000, Rothberg claims. Nanopores are an “elegant” technology, he says, but Ion Torrent already has a foot in the door. “As we saw last summer with the E. coli outbreak in Germany, people are already using it,” he says. By contrast, the MinION would take about 6 hours to complete a human genome, Brown claims, though the company plans to market the device for use in shorter sequencing tasks like identifying pathogens, or screening for genetic mutations that can increase the risk of certain diseases. Each unit is expected to cost $900 when it goes on sale later this year. GridION can sequence a human genome in around 15 minutes. “The biggest strength of nanopore sequencing is that it generates very long reads, which has been a limitation for most other technologies,” says Loman. As for clinical applications, David Rasko at the Institute for Genome Sciences at the University of Maryland in Baltimore, says the MinION could have huge benefits. “It may have serious implications for public health and it could really change the way we do medicine,” he says. “You can see every physician walking around the hospital with a pocketful of these things.” It is likely to increase the number of scientists generating sequencing data, he adds. n Your phone knows what you are about to say... SMARTPHONES may soon get a lot better at finishing your sentences for you – with the help of words and phrases gleaned from crowdsourcing. The software packages in today’s phones often struggle with texts and voice commands if a user attempts words or phrases that aren’t included in the phone’s database. To see if the crowd could help, Keith Vertanen of Montana Tech in Butte and Per Ola Kristensson at the University of St Andrews, UK, called upon workers of the Amazon Mechanical Turk. The plan was to try and improve a predictive system used in Augmented and Alternative Communication (AAC) devices, which help disabled people to communicate by painstakingly typing out words interpreted from their muscle twitches or blinks. The researchers paid 298 Mechanical Turk workers to imagine phrases they might need if they had motor neuron disease or cerebral palsy. Their responses produced nearly 6000 useful phrases. Next, by trawling through Twitter postings and other social media texts the workers extracted sentences and phrases with similar structures to the initial phrases, expanding the corpus to tens of millions of entries. The result is a system that needs 11 per cent fewer keystrokes than a standard AAC device. For anyone struggling over every word it is a big improvement, says Fly off at a tangent “Metaphorical search engine” named after Catch-22 ’s anti-hero sparks creativity

Crowdsourcing improves predictive texting

  • Upload
    duncan

  • View
    214

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Crowdsourcing improves predictive texting

24 | NewScientist | 25 February 2012

TECHNOLOGY

or just make you feel trapped.“Today’s internet search tells

us what the world already knows,” explains the CEO of YossarianLives, J. Paul Neeley. “We don’t want you to know what everyone else knows, we want you to generate new knowledge.” He says that metaphors help us see existing

TYPING “love” into Google, I find the Wikipedia entry, a “relationship calculator” and Lovefilm, a DVD rental service. Doing the same in YossarianLives, a new search engine due to launch this year, I might receive quite

different results: “river”, “sleep” and “prison”. Its creators claim YossarianLives is a metaphorical search engine, designed to spark creativity by returning disparate but conceptually related terms. So the results perhaps make sense if you accept that love can ebb and flow, provide rejuvenating comfort

–A mad idea, but it just might work–

Jacob Aron

par

am

Ou

NT/

fiLm

waY

s/TH

E kO

baL

COLL

ECTi

ON

< long time,” says geneticist and blogger Daniel MacArthur.

Oxford Nanopore is set to face stiff competition, however. Jonathan Rothberg, a scientist and entrepreneur who founded rival firm 454 Life Sciences, also announced at the AGBT conference that his start-up company, Ion Torrent, will be launching a desktop sequencing machine. The Ion Proton identifies bases by using transistors to detect hydrogen ions as they are given off during the polymerisation of DNA.

This device will be capable of sequencing a human genome in 2 hours for around $1000, Rothberg claims. Nanopores are an “elegant” technology, he says, but Ion Torrent already has a foot in the door. “As we saw last summer with the E. coli outbreak in Germany, people are already using it,” he says.

By contrast, the MinION would take about 6 hours to complete a human genome, Brown claims, though the company plans to market the device for use in shorter sequencing tasks like identifying pathogens, or screening for genetic mutations that can increase the risk of certain diseases. Each unit is expected to cost $900 when it goes on sale later this year. GridION can sequence a human genome in around 15 minutes.

“The biggest strength of nanopore sequencing is that it generates very long reads, which has been a limitation for most other technologies,” says Loman.

As for clinical applications, David Rasko at the Institute for Genome Sciences at the University of Maryland in Baltimore, says the MinION could have huge benefits. “It may have serious implications for public health and it could really change the way we do medicine,” he says. “You can see every physician walking around the hospital with a pocketful of these things.” It is likely to increase the number of scientists generating sequencing data, he adds. n

Your phone knows what you are about to say...SMARTPHONES may soon get a lot better at finishing your sentences for you – with the help of words and phrases gleaned from crowdsourcing.

The software packages in today’s phones often struggle with texts and voice commands if a user attempts words or phrases that aren’t included in the phone’s database.

To see if the crowd could help, Keith Vertanen of Montana Tech in Butte and Per Ola Kristensson at the University of St Andrews, UK, called upon workers of the Amazon Mechanical Turk. The plan was to try and improve a predictive system used in Augmented and Alternative Communication (AAC) devices, which help disabled people to communicate by painstakingly typing out words interpreted from their muscle twitches or blinks.

The researchers paid 298 Mechanical Turk workers to imagine

phrases they might need if they had motor neuron disease or cerebral palsy. Their responses produced nearly 6000 useful phrases. Next, by trawling through Twitter postings and other social media texts the workers extracted sentences and phrases with similar structures to the initial phrases, expanding the corpus to tens of millions of entries.

The result is a system that needs 11 per cent fewer keystrokes than a standard AAC device. For anyone struggling over every word it is a big improvement, says

fly off at a tangent“metaphorical search engine” named after Catch-22 ’s anti-hero sparks creativity

120225_N_TechOpenSpread.indd 24 20/2/12 17:30:01

Page 2: Crowdsourcing improves predictive texting

25 February 2012 | NewScientist | 25

For more technology stories, visit newscientist.com/technology

concepts in a new way and create innovative ideas. For example, using a Formula 1 pit crew as a metaphor for doctors in an emergency room has helped improve medical procedures. YossarianLives aims to create new metaphors for designers, artists, writers or even scientists.

The name is derived from the anti-hero of the novel Catch-22, as the company wants to solve the catch-22 of existing search engines, which they say help us to access current knowledge but also harm us by reinforcing that knowledge above all else.

Neeley won’t reveal exactly how the engine works, but says they aren’t directly teaching the

system any metaphors. Instead, they are using statistical natural-language processing techniques similar to those employed by Google and other search engines. These methods map out the relationship between words, putting closely related concepts such as “dog” and “bone” near to each other while placing unrelated terms such as “dog” and “ironing board” further apart. While a regular search engine typically returns the nearest terms, Neeley says YossarianLives looks for words that are further

away but still share a linking conceptual structure.

The search engine’s success depends on divining such connections. Phil Blunsom, a researcher in computational linguistics at the University of Oxford, is sceptical. “Detecting metaphors is pretty difficult in itself, mapping between them is very difficult, and to do this with enough accuracy to be usable seems a bit hopeful,” he says.

One version of the search engine creates a list of words based on an internet search, but only returns a single image representing one of those words, after searching for appropriately tagged photos on Flickr.

The results can be ambiguous. When New Scientist tested the system and searched for “science” the image of a happy couple was returned, which Neeley says could indicate “chemistry”. As such, the user seems to be the one creating the metaphorical connection rather than the software – suggesting that perhaps random results might be just as good for sparking creativity, though Neeley insists that isn’t the case.

The company is now testing its algorithms to determine the best balance point between the literal results of the type returned by Google and random results. “There’s this really nice sweet spot where results are disparate but close enough that people can make that leap,” says Neeley. n

“Detecting metaphors is pretty difficult in itself, mapping between them is even more difficult”

One Per Cent

SCa

lad

O

Magic away photo intrudersHas a stream of schoolchildren or drunken stag party just ruined your holiday snap of Rome’s Trevi fountain? A new smartphone app promises to let you edit out such intruders in your photos, leaving the beautiful view intact. Called Remove, the upcoming app, developed by Swedish photography firm Scalado, takes a burst of shots of your scene. It identifies the objects which are moving, based on their relative position in each frame. These objects are highlighted and you can delete the ones you don’t want. Scalado will showcase the app at the 2012 Mobile World Conference in Barcelona, Spain, this month.

Interactive maps let tribes keep tabsOnline games and interactive maps could help pygmy tribes in Africa fight logging and poaching in their area. In 2009, anthropologist Jerome Lewis developed a handheld device that let tribes geotag important trees in their area to create online maps. Now Lewis is building a platform for online games to teach indigenous people how to read and interact with these maps using smartphones. One game, Blindate, requires map-reading in order for two players to meet one another. Its aim is to help tribe members get used to locating themselves on maps.

Tweet me no politicsPeople on Twitter link to others who share their interests, but not always their political views. Yaneer Bar-Yam at the New England Complex Systems Institute in Cambridge, Massachusetts, collected 521,733 tweets posted by 223,950 users who tweeted at least three links to articles in The New York Times between 14 and 29 September 2011. The team then used a mathematical model to work out who tweeted to whom, where and about what. Rather than around politics, people clustered mainly according to whether their links and interests were global, US-wide, or just within New York City. Of the 10 most popular subjects US-wide, science came ninth.

For breaking tech news go to: newscientist.com/onepercent

Kristensson. It should also work when applied to more standard text and speech recognition systems, he says. The work appears in the Proceedings of the 2011 Conference on Empirical Method in Natural Language Processing.

David Weir at the University of Sussex, UK, agrees that the system has potential. “Getting adequate quantities of good quality data to build statistical [language] models is one of the most significant challenges in this area,” he says. duncan Graham-rowe n–Here to help–

Get

ty

120225_N_TechOpenSpread.indd 25 20/2/12 17:30:16