Making search better in Catalonia, Estonia, and everywhere else

Filed under: Official Google Blog — Wrote by Lees on Sunday, May 4th, 2008 @ 2:27 am

Posted by Paul Haahr and Steve Baker,
Software Engineers, Search Quality

We recently began a
series of posts on how we harness the power of data. href="http://googleblog.blogspot.com/2008/03/why-data-matters.html"
id="m2px" >Earlier we told
you how data has
been critical to the advancement of search; about using data to href="http://googleblog.blogspot.com/2008/03/using-log-data-to-help-keep-you-safe.html"
id="cxjy" >make our
products safe and to href="http://googleblog.blogspot.com/2008/03/using-data-to-help-prevent-fraud.html"
id="thga" >prevent fraud;
this post is the newest in
the series. -Ed.

One of the most important uses of data at Google is building
language models. By analyzing how people use language, we build
models that enable us to interpret searches better, offer spelling
corrections, understand when alternative forms of words are needed,
offer >language
translation
, and even href="http://googleblog.blogspot.com/2007/05/search-without-boundaries.html"
id="e-d4"
>suggest
when searching in another language is appropriate.

One place we use these models is to find alternatives for words
used in searches. For example, for both English and French users,
"GM" often means the company "General Motors,"
but our language model understands that in French searches like href="http://www.google.fr/search?hl=fr&q=seconde GM" id="myy8"
>seconde GM, it means "Guerre
Mondiale" (World War), whereas in href="http://www.google.fr/search?hl=fr&q=STI GM" id="yz1y"
>STI GM it means "Génie Mécanique"
(Mechanical Engineering). Another meaning in English is
"genetically modified," which our language model
understands in href="http://www.google.com/search?q=GM corn&hl=en" id="rf9x"
>GM corn. We've learned this based on the
documents we've seen on the web and by observing that users
will use both "genetically modified" and "GM"
in the same set of searches.

We use similar techniques in all languages. For example, if a
Catalan user searches for href="http://www.google.es/search?hl=ca&q=resultat elecció barris BCN"
id="w3e2" >resultat elecció barris BCN
(searching for the result of a neighborhood election in Barcelona),
Google will also find pages that use the words
"resultats" or "eleccions" or that talk about
"Barcelona" instead of "BCN." And our language
models also tell us that the Estonian user looking for href="http://www.google.ee/search?hl=et&q=Tartu juuksur"
id="b8_i" >Tartu juuksur, a barber in
Tartu, might also be interested in a "juuksurisalong," or
"barber shop."

In the past, language models were built from dictionaries by hand.
But such systems are incomplete and don't reflect how people
actually use language. Because our language models are based on
users' interactions with Google, they are more precise and
comprehensive — for example, they incorporate names, idioms,
colloquial usage, and newly coined words not often found in
dictionaries.

When building our models, we use billions of web documents and as
much historical search data as we can, in order to have the most
comprehensive understanding of language possible. We analyze how
our users searched and how they revised their searches. By looking
across the aggregated searches of many users, we can infer the
relationships of words to each other.

Queries are not made in isolation — analyzing a single search in
the context of the searches before and after it helps us understand
a searcher's intent and make inferences. Also, by analyzing how
users modify their searches, we've learned related words,
variant grammatical forms, spelling corrections, and the concepts
behind users' information needs. (We're able to make these
connections between searches using cookie IDs — small pieces of
data stored in visitors' browsers that allow us to distinguish
different users. To understand how cookies work, href="http://youtube.com/watch?v=XfZLztx8cKI"
id="e0o:" >watch this video.)

To provide more relevant search results, Google is constantly
developing new techniques for language modeling and building better
models. One element in building better language models is href="http://googleblog.blogspot.com/2005/08/machines-do-translating.html"
id="z0.8" >using more data collected over
longer periods of time. In languages with many documents and users,
such as English, our language models allow us to improve results
deep into the "long tail" of searches, learning about
rare usages. However, for languages with fewer users and fewer
documents on the web, building language models can be a challenge.
For those languages we need to work with longer periods of data to
build our models. For example, it takes more than a year of
searches in Catalan to provide a comparable amount of data as a
single day of searching in English; for Estonian, more than two and
a half years worth of searching is needed to match a day of
English. Having longer periods of data enables us to improve search
for these less commonly used languages.

At Google, we want to ensure that we can help users everywhere find
the things they're looking for; providing accurate, relevant
results for searches in all languages worldwide is core to
Google's mission. Building extensive models of historical usage
in every language we can, especially when there are few users, is
an essential piece of making search work for everyone, everywhere.

Tags: , , , , , , ,

New C-SPAN channel on YouTube

Filed under: Official Google Blog — Wrote by Lees on Sunday, May 4th, 2008 @ 2:26 am

Posted by Steve Grove, Head of News and
Politics, YouTube

As the 2008 election progresses, more and more voters are tuning
into YouTube to stay on top of the action. Our href="http://www.youtube.com/youchoose" >You Choose '08
platform now features content from candidates, news organizations,
and voters, and we've made it easier than ever to see where the
candidates stand on each of the major issues in this election. The
next big stop on the campaign trail is Pennsylvania, so we're
partnering with C-SPAN to collect videos from voters across the
country who will answer the question, "What is the most
important issue to you in this election?"

value="http://www.youtube.com/v/QPZr6EPIXxI&hl=en" /> type="application/x-shockwave-flash" wmode="transparent"
height="355" width="425" />

This is our fourth voter video program. We started in href="http://www.youtube.com/iowacaucuses" >Iowa, went on to href="http://www.youtube.com/nhprimaries" >New Hampshire, and
then went national on href="http://www.youtube.com/supertuesday" >Super Tuesday. With
C-SPAN, we're adding a new twist: in the week leading up to the
Pennsylvania primaries on April 22, we'll be on the C-SPAN
election bus throughout the state, collecting videos straight from
the campaign trail.

We think C-SPAN is the perfect partner for this program. Started in
1979, C-SPAN is a private nonprofit whose mission is to
"provide public access to the political process." That
mission is closely aligned with our own: to connect voters and
candidates through the power of online video. In a way, YouTube
politics has given voters everywhere the opportunity to create
their own "C-SPANs" and make the election all the more
transparent and accessible to voters everywhere. And that's a
fundamentally good thing. So tune into href="http://www.youtube.com/cspan" >C-SPAN's YouTube
channel in the coming days to check out the videos we're
getting from voters — and to submit your own

Tags: , , , , , , ,

The year in YouTube politics

Filed under: Official Google Blog — Wrote by Lees on Sunday, April 27th, 2008 @ 1:36 am

Posted by Steve Grove, Head of YouTube
News and Politics

Candidates caught
singing on camera
. href="http://youtube.com/watch?v=nQi7PaYKqTU" >Dorm-room
presidential interviews. A martial arts master href="http://youtube.com/watch?v=EjYv2YW6azE" >endorsing a
presidential candidate. Citizen-created campaign href="http://youtube.com/watch?v=6h3G-lMZxjo" >commercials. And
two presidential
primary debates
. 2007 was quite a year for YouTube
Politics.

At this time last year, YouTube had developed a reputation as a
place where href="http://youtube.com/watch?v=r90z0PMnKwI" >"gotcha!"
videos posted by ciitzens were changing the political
landscape. Some even called the 2006 U.S. midterm elections
"the YouTube election" after several candidates were
caught on camera saying things they probably shouldn't have.
But in 2007, that changed. Seizing the YouTube opportunity,
presidential candidates came rushing to the platform themselves,
setting up official campaign channels on our href="http://youtube.com/youchoose" >You Choose '08
platform. Seven of the 16 presidential candidates even announced
their candidacies on YouTube.

Candidates and voters now speak to each other through video. At
their best, campaigns use YouTube not as a shrunken TV screen
through which to distribute their soundbites, but as a window
through which to have a dialogue with the American people.
YouTube's leveling effect is this: anyone can upload a video
with their political message, and the best content rises to the top
through community view counts, rankings, linkages, and embeds. Any
voter with a video camera and access to the Internet has the
opportunity to be seen and heard.

Our two presidential debates with CNN highlighted this phenomenon.
Eight thousand video questions were submitted for the two
record-setting debates, which opened up a traditionally closed
event to the rest of the world via YouTube. Time was, you had to be
in New Hampshire, Iowa, or Florida to get access to candidates at a
debate. With questions coming directly from voters via video, our
YouTube debates helped to break down some of the geographical
barriers that have so sharply defined American politics in the
past.

value="http://www.youtube.com/v/JEmulj-6tgQ&rel=1" /> type="application/x-shockwave-flash" wmode="transparent"
width="425" height="355" />

So what's ahead in 2008? Things are only going to get more
exciting. As Congressional and Senate races heat up, you'll see
more and more candidates coming to YouTube. And as the presidential
races narrows down to two candidates, YouTube will be a critical
battlefront in the general election. With voters, candidates, issue
groups, media companies, trade associations, lobbyists and
activists all interacting on the same level platform, 2008 promises
to be a true "YouTube Election.

Tags: , , , , , , ,

Spotlight on Seattle

Filed under: Official Google Blog — Wrote by Lees on Thursday, December 13th, 2007 @ 2:52 am

Posted by Steve Yegge, Software
Engineer

Our Seattle/Kirkland engineering office just celebrated its third
birthday! And like our friends in New York href="http://googleblog.blogspot.com/2007/10/if-you-build-it-they-will-eat-it.html" >
who recently commemorated an anniversary, we too had a very
special cake to mark the occasion, as you can see here.

onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}"
href="http://bp3.blogger.com/_7ZYqYi4xigk/RxkVNYw7qtI/AAAAAAAAAKo/R3G7ISrQmlc/s1600-h/kirkcake2.jpg" >
height="1" width="1" />

Tags: , , , , , , ,

Calling all developers: $10M Android challenge

Filed under: Official Google Blog — Wrote by Lees on Wednesday, December 12th, 2007 @ 9:57 pm

Posted by Steve Horowitz, Engineering
Director

Last week we href="http://googleblog.blogspot.com/2007/11/wheres-my-gphone.html" >
announced the href="http://www.openhandsetalliance.com/" >Open Handset
Alliance, a group of mobile and technology leaders committed to
improving the mobile experience and href="http://www.youtube.com/watch?v=6rYozIZOgDk" >Android, the
first truly open and comprehensive platform for mobile
devices.

Today, the team is releasing an early look at the href="http://code.google.com/android/" >Android SDK for
developers interested in building applications for Android. To get
things rolling, we've also announced the href="http://code.google.com/android/adc.html" >Android Developer
Challenge, which provides $10 million in awards for developers
who build great applications for Android. Read more on the new href="http://android-developers.blogspot.com/" >Android Developers
blog to learn about this exciting mobile platform.

With so many brilliant minds striving to design engaging,
innovative applications, mobile users around the world (3 billion
and counting!) can expect phones equipped with dynamic and
unprecedented applications very soon. height="1" width="1" />

Tags: , , , , , , ,

Lost? No, found!

Filed under: Official Google Blog — Wrote by Lees on Wednesday, December 12th, 2007 @ 9:55 pm

Posted by Steve Lee, Product Manager,
Google Mobile team

We know a lot of you are using Google Maps for mobile to view maps
and satellite imagery, find local businesses, and get directions.
But to date not many of you have been able to take advantage of the
increased speed and convenience that location information from
technologies like GPS can afford, if only because there are very
few GPS-enabled devices on the market.

Well, when it comes to location information, GPS is no longer the
only game in town. Today we released a version of Google Maps for
mobile with a new beta technology called href="http://www.google.com/mobile/gmm/mylocation/index.html" >My
Location, which provides approximate location information for
those of us without GPS, and complements GPS location information
for those who do have it. Head over to our href="http://googlemobile.blogspot.com/2007/11/new-magical-blue-circle-on-your-map.html" >
new mobile blog to learn more. height="1" width="1" />

Tags: , , , , , , ,

The state of our video ID tools

Filed under: Official Google Blog — Wrote by Lees on Tuesday, December 11th, 2007 @ 6:18 am

Posted by Steve Chen, YouTube
co-founder

Recent speculation and stories like this href="http://online.wsj.com/article/SB118161295626932114.html" >Wall Street Journal article
or this href="http://www.reuters.com/article/wtMostRead/idUSWEN871820070612" >
Reuters report on YouTube's use of video identification
tools made us think it would be useful to clarify what we’re doing.

We’ve been developing improved content identification for months,
and we’re confident that in the not-too-distant future, we’ll
unveil an innovative solution that will work for users and content
creators alike. This is one of the most technologically complicated
tasks that we have ever undertaken. But YouTube has always been
committed to developing sustainable and scalable tools that work
for all content owners.

Even though we haven’t given too many details, we’ve been hard at
work. Earlier this year we implemented audio fingerprinting
technology from Audible Magic, to help identify the href="http://www.youtube.com/audioswap_main" >audio
content
of music partners like Warner Music, Sony BMG, and
Universal. Today we're experimenting with video identification
tools, and will share with you a few core principles driving our
technology development, past and present.

We are beginning tests on an automated system to identify and match
specific videos. The technology extracts key visual aspects of
uploaded videos and compares that information against reference
material provided by copyright holders. Achieving the accuracy to
drive automated policy decisions is difficult, and requires a
highly tuned system. Once accuracy is achieved, the challenge
becomes speed and scale to support the millions of people who use
YouTube every day. We are working with some of the major media
companies to test what we have developed. We’re excited about the
progress so far, and we’re dedicated to making these tests
successful, but as always with cutting-edge technologies, there’s
no guarantee of success.

Now, when it comes to spotting pornography and graphic violence,
and other content prohibited by our href="http://www.youtube.com/t/terms" >terms of use, nothing
beats our community flagging. Once a user flags a video, we
immediately review it and remove it if we find a violation. But our
community can’t identify infringing content. We all know
pornography and violence when we see them. But href="http://youtube.com/t/howto_copyright" >copyright status
can only be determined by the copyright holder. That is because
almost anyone who creates an original video has the copyright for
that work, and such a wide range of copyright holders'
preferences vary widely.

Some copyright holders want control over every use of their
creation. Many professional artists and media companies post their
latest videos without telling us, while some home video-makers
don't want their stuff online. Some legal departments take down
a video one day and the marketing department puts it up the next.
Which is their right, but our community can’t predict those things,
and neither can we. The same is true for technology. No matter how
good our video identification technology gets, it will never be
able to read copyright-holders’ minds.

If a content owner identifies material that she doesn’t want on
YouTube, she can request its removal with the click of a mouse. If
particular users repeatedly infringe copyrights, we terminate their
accounts. We have long made a practice of creating a unique
"hash" of every video removed for alleged copyright
infringement and blocking re-uploads of the hash. We href="http://youtube.com/t/community_guidelines" >educate users
on what is and isn’t permissible under the law. Our upcoming video
identification system will be our latest way of empowering
copyright holders, going above and beyond legal requirements.

We’ll continue our focus on delivering a great user experience.
YouTube's no-fuss upload lets video artists collapse the gap
between the creative moment and its worldwide publication. It helps
our hundreds of media partners - as well as marketers and
advertisers - spread their hottest work while it's still hot.
And it enables presidential candidates participating in our
YouChoose 2008 program to engage in a direct, open dialogue with
voters, bringing transparency, access and authenticity to the
political process. We’re carefully designing our new identification
technologies to not impede those free and fast forms of
expression.

In conclusion, a content management system has to have technology
that provides high quality matching and detection, but it also has
to apply business rules in ways that support the business
objectives of partners while providing high quality user
experiences. With the introduction of our video identification
tools, YouTube will continue to be the leader in online video, and
the premier destination for watching and sharing original videos
worldwide. Now, back to href="http://youtube.com/testtube" >work…

Update: Added direct link
to Wall Street Journal story.

Tags: , , , , , , ,

International Cleanup Weekend: Think globally, clean locally

Filed under: Official Google Blog — Wrote by Lees on Friday, December 7th, 2007 @ 2:14 am

Posted by Steve Miller, Google Earth
Outreach

On October 13th and 14th, Googlers and many
people around the world will head out to clean up local parks,
beaches, trails and other places close to home. We'll be
planning our cleanups using href="http://maps.google.com/" >Google Maps and sharing our
plans with friends and families, along with an invitation to help.
So far, Googlers have sent in almost 100 cleanup maps and proposed
plans, and have invited more than 900 of their personal contacts to
help.

And since many small cleanups add up to one big impact, we hope
you'll join in too. It takes just a few minutes to plan your
own cleanup, make a map of it, and send your map to us — we'll
add it to a growing map of all the cleanups around the world. If
you keep your cleanup small (groups of 6-10 people work best) and
close to home, it'll be easy to organize and you can be sure
that you're doing what's most important for your
neighborhood. After your cleanup, share your work with the world by
posting photos and videos to your map.

If you'd like to join this global effort, you can href="http://maps.google.com/help/maps/cleanup/" >get started
here. height="1" width="1" />

Tags: , , , , , , ,

Cingular BlackBerry 8800 has Google Maps and GPS

Filed under: Official Google Blog — Wrote by Lees on Monday, December 3rd, 2007 @ 10:10 pm

Posted by Steve Lee, Product Manager,
Google Maps for mobile

Some of us have a great sense of direction, and others find
themselves, well, a little lost at times. For those in the latter
camp, you can thank Cingular for launching the href="http://business.cingular.com/businesscenter/blackberry8800/?_requestid=182418"onClick="javascript:urchinTracker('/outgoing/cingular_com'">
BlackBerry® 8800, the latest open GPS-enabled device from a
major U.S. carrier. That means that when you use

href="http://www.google.com/gmm/gps.html?utm_source=en-et-blog&utm_medium=et&utm_source=en">
Google Maps for mobile
, your location automatically shows up on
the map.

When you download Google Maps for mobile and fire it up, you'll
notice something quite unusual: a blinking blue dot showing you
exactly where you are! You can use your auto-detected location to
get directions and perform local searches without even entering
your location — instead of "pizza 94043", just enter
"pizza" — and we'll automatically know you want
pizza in the zip code "94043."

So here's a great big hats off to Cingular — this BlackBerry®
8800 with GPS is awesome!

Tags: , , , , , , ,

Cingular BlackBerry 8800 has Google Maps and GPS

Filed under: Official Google Blog — Wrote by Lees on Sunday, December 2nd, 2007 @ 11:12 pm

Posted by Steve Lee, Product Manager,
Google Maps for mobile

Some of us have a great sense of direction, and others find
themselves, well, a little lost at times. For those in the latter
camp, you can thank Cingular for launching the href="http://business.cingular.com/businesscenter/blackberry8800/?_requestid=182418"onClick="javascript:urchinTracker('/outgoing/cingular_com'">
BlackBerry® 8800, the latest open GPS-enabled device from a
major U.S. carrier. That means that when you use

href="http://www.google.com/gmm/gps.html?utm_source=en-et-blog&utm_medium=et&utm_source=en">
Google Maps for mobile
, your location automatically shows up on
the map.

When you download Google Maps for mobile and fire it up, you'll
notice something quite unusual: a blinking blue dot showing you
exactly where you are! You can use your auto-detected location to
get directions and perform local searches without even entering
your location — instead of "pizza 94043", just enter
"pizza" — and we'll automatically know you want
pizza in the zip code "94043."

So here's a great big hats off to Cingular — this BlackBerry®
8800 with GPS is awesome!

Tags: , , , , , , ,

Copyright © 2007 Google Adsense College.
Powered by GoogleSchool. All Rights Reserved.