Moving to Unicode 5.1

Filed under: Official Google Blog — Wrote by Lees on Tuesday, May 13th, 2008 @ 10:04 pm

Posted by Mark Davis, Senior
International Software Architect

Google has just begun supporting href="http://www.unicode.org/press/pr-5.1.html" id="yh5d" >Unicode
5.1, less than one month after it was released. It's now
available in search, so people speaking languages such as Malayalam
can now search for words containing the new characters in Unicode
5.1.

Web pages can use a variety of different character encodings, like
ASCII, Latin-1, or Windows 1252, or href="http://www.unicode.org/book/aboutbook.html#Foreword"
id="c42w" >Unicode. Most encodings can only represent a few
languages, but Unicode will handle anything from Chinese to French
to Arabic. We have long used Unicode as the internal format for all
the text we search: any other encoding is first converted to
Unicode for processing. So we regularly update to each new version
of Unicode (and relevant related standards like href="http://unicode.org/cldr/" id="az_w" >CLDR and href="http://www.rfc-editor.org/rfc/bcp/bcp47.txt"
id="da92" >BCP 47) to make sure we are current. Thus Unicode
plays a key role in our href="http://www.google.com/corporate/" >mission.

Uptick in native Unicode webpages

Just last December there was an interesting
milestone on the web. For the first time, we found that Unicode was
the most frequent encoding found on web pages, overtaking both
ASCII and Western European encodings—and by coincidence, within 10
days of one another. What's more impressive than simply
overtaking them is the speed with which this happened; take a look
at the blue line in this graph.

onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}"
href="http://bp1.blogger.com/_Ap14FtNN91w/SBzrtHJfLnI/AAAAAAAAA5U/TV7_g2_sWq0/s1600-h/Unicode2.gif" >
alt="" id="BLOGGER_PHOTO_ID_5196287230324190834"
border="0" />

You can see a long-term decline in pages encoded in
ASCII (unaccented letters A through Z). More recently, there's
been a significant drop in the use of encodings covering only
Western European letters (ASCII and a few accented letters like Ä,
Ç, and Ø). We're seeing similar declines in other
language-specific encodings. Unicode, on the other hand, is showing
a sharp increase in usage.

This is based on our indexing of web pages, and thus may vary
somewhat from what other search engines find. However, the trends
are pretty clear, and the continued rise in use of Unicode makes it
even easier to do the processing for the many languages that we
cover.

href="http://feeds.feedburner.com/~f/blogspot/MKuf?a=QxeygH" > border="0" />

height="1" width="1" />

Tags: , , , , , , ,

Building the Indic web

Filed under: Official Google Blog — Wrote by Lees on Tuesday, May 13th, 2008 @ 10:04 pm

Posted by Naren Manappa, Software
Engineer, and Melchi Sundararaj, Member of Technical
Team

We have come a long way from our

href="http://googleblog.blogspot.com/2007/04/now-you-can-blog-in-hindi.html"
id="kb.v" >first Indic transliteration release
to our current
support for transliteration in 5 languages — Hindi, Kannada,
Malayalam, Tamil and Telugu — for a broader set of Google
products. We are also happy to release our very first

href="http://www.google.com/translate_t?langpair=en|hi"
id="v92_" >English to Hindi translation service
. Read on to
understand how you can use these services to create, communicate
and search in your language, and more.

Type in your language easily on our

href="http://www.google.co.in/transliterate/indic" id="vsjs" >Indic
Transliteration Labs page
.

Add the href="http://www.google.com/ig/adde?moduleurl=www.google.com/ig/modules/indic_transliteration.xml" >
transliteration gadget to your iGoogle page.

onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}"
href="http://bp3.blogger.com/_Ap14FtNN91w/SCCXrnJfLoI/AAAAAAAAA5g/Q53oFoz7y9s/s1600-h/Indic1.gif" >
alt="" id="BLOGGER_PHOTO_ID_5197320745484496514"
border="0" />

Express your views and create more content: href="http://www.blogger.com/indic/hi" id="oy3z" >Blogger.

onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}"
href="http://bp1.blogger.com/_Ap14FtNN91w/SCCZ0HJfLpI/AAAAAAAAA5o/Irch2xs-SL0/s1600-h/Indic2.gif" >
alt="" id="BLOGGER_PHOTO_ID_5197323090536640146"
border="0" />

Scrap your friends in your language: href="http://www.orkut.com/scrapbook.aspx"
id="q4ef" >orkut.

onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}"
href="http://bp1.blogger.com/_Ap14FtNN91w/SCCaIHJfLqI/AAAAAAAAA5w/9H4lxvM2JhQ/s1600-h/Indic3.gif" >
alt="" id="BLOGGER_PHOTO_ID_5197323434134023842"
border="0" />

Find information: href="http://www.google.co.in/hi" id="m0nc" >Google Suggest in
your language.

onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}"
href="http://bp3.blogger.com/_Ap14FtNN91w/SCCaenJfLrI/AAAAAAAAA54/ISqt3BB4gho/s1600-h/Indic4.gif" >
alt="" id="BLOGGER_PHOTO_ID_5197323820681080498"
border="0" />

You can now also try out our brand
new href="http://www.google.com/translate_t?langpair=en|hi"
id="js74" >English to Hindi translation service, and the href="http://www.google.com/translate_s"
id="wcru" >translated search feature that lets
you query in Hindi, obtain search results for the translated
query in English, and then see the Hindi translations of these
results.

onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}"
href="http://bp0.blogger.com/_Ap14FtNN91w/SCCa13JfLsI/AAAAAAAAA6A/WZlX501wnCE/s1600-h/Indic5.gif" >
alt="" id="BLOGGER_PHOTO_ID_5197324220113039042"
border="0" />

For more information on all of
these, read href="http://www.google.co.in/intl/en/press/pressrel/20080505_googletranslate_hindi.html"
id="ytl0" >our press release.

href="http://feeds.feedburner.com/~f/blogspot/MKuf?a=2JA8iH" > border="0" />

height="1" width="1" />

Tags: , , , , , , ,

Search within a site: A tale of teleportation

Filed under: Official Google Blog — Wrote by Lees on Sunday, May 4th, 2008 @ 2:31 am

Posted by Ben Lee, Software Engineer,
and Jack Menzel, Product Manager

Have you ever forgotten the exact address of a site that you wanted
to visit? Not a problem - just type the name of the site into the
Google search box and hopefully it appears at the top of the search
results page.

We call this "teleporting", and we're pleased that we
have been able to minimize the need to remember an alphabet soup of
.coms, .nets, and .orgs out of everyone's lives. However, one
of the trends we noticed while studying teleporting was that there
were lots of searchers who would type the name of a specific
website as if they wanted to teleport, but would then immediately
issue another more a refined search within this site.

For example, if someone is looking for official information about
the id="ftnz" >Hubble Space Telescope on the NASA website, one might
first search for [NASA] and then [NASA Hubble Telescope], like
this:

onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}"
href="http://bp2.blogger.com/_Ap14FtNN91w/R883J65W4II/AAAAAAAAA20/CxvglcYMwDQ/s1600-h/search_subset.gif" >
alt="" id="BLOGGER_PHOTO_ID_5174415140440170626"
border="0" />

Through href="http://googleblog.blogspot.com/2006/04/this-is-test-this-is-only-test.html"
id="g-73" >experimentation, we found that presenting users with
a search box as part of the result increases their likelihood of
finding the exact page they are looking for. So over the past few
days we have been testing, and today we have fully rolled out, a
search box that appears within some of the search results
themselves. This feature will now occur when we detect a high
probability that a user wants more refined search results within a
specific site. Like the rest of our snippets, the sites that
display the site search box are chosen algorithmically based on
metrics that measure how useful the search box is to users.

We hope that you will make use of the site search box in order to
get the information you're looking for as quickly and easily as
possible.

Tags: , , , , , , ,

Making search better in Catalonia, Estonia, and everywhere else

Filed under: Official Google Blog — Wrote by Lees on Sunday, May 4th, 2008 @ 2:27 am

Posted by Paul Haahr and Steve Baker,
Software Engineers, Search Quality

We recently began a
series of posts on how we harness the power of data. href="http://googleblog.blogspot.com/2008/03/why-data-matters.html"
id="m2px" >Earlier we told
you how data has
been critical to the advancement of search; about using data to href="http://googleblog.blogspot.com/2008/03/using-log-data-to-help-keep-you-safe.html"
id="cxjy" >make our
products safe and to href="http://googleblog.blogspot.com/2008/03/using-data-to-help-prevent-fraud.html"
id="thga" >prevent fraud;
this post is the newest in
the series. -Ed.

One of the most important uses of data at Google is building
language models. By analyzing how people use language, we build
models that enable us to interpret searches better, offer spelling
corrections, understand when alternative forms of words are needed,
offer >language
translation
, and even href="http://googleblog.blogspot.com/2007/05/search-without-boundaries.html"
id="e-d4"
>suggest
when searching in another language is appropriate.

One place we use these models is to find alternatives for words
used in searches. For example, for both English and French users,
"GM" often means the company "General Motors,"
but our language model understands that in French searches like href="http://www.google.fr/search?hl=fr&q=seconde GM" id="myy8"
>seconde GM, it means "Guerre
Mondiale" (World War), whereas in href="http://www.google.fr/search?hl=fr&q=STI GM" id="yz1y"
>STI GM it means "Génie Mécanique"
(Mechanical Engineering). Another meaning in English is
"genetically modified," which our language model
understands in href="http://www.google.com/search?q=GM corn&hl=en" id="rf9x"
>GM corn. We've learned this based on the
documents we've seen on the web and by observing that users
will use both "genetically modified" and "GM"
in the same set of searches.

We use similar techniques in all languages. For example, if a
Catalan user searches for href="http://www.google.es/search?hl=ca&q=resultat elecció barris BCN"
id="w3e2" >resultat elecció barris BCN
(searching for the result of a neighborhood election in Barcelona),
Google will also find pages that use the words
"resultats" or "eleccions" or that talk about
"Barcelona" instead of "BCN." And our language
models also tell us that the Estonian user looking for href="http://www.google.ee/search?hl=et&q=Tartu juuksur"
id="b8_i" >Tartu juuksur, a barber in
Tartu, might also be interested in a "juuksurisalong," or
"barber shop."

In the past, language models were built from dictionaries by hand.
But such systems are incomplete and don't reflect how people
actually use language. Because our language models are based on
users' interactions with Google, they are more precise and
comprehensive — for example, they incorporate names, idioms,
colloquial usage, and newly coined words not often found in
dictionaries.

When building our models, we use billions of web documents and as
much historical search data as we can, in order to have the most
comprehensive understanding of language possible. We analyze how
our users searched and how they revised their searches. By looking
across the aggregated searches of many users, we can infer the
relationships of words to each other.

Queries are not made in isolation — analyzing a single search in
the context of the searches before and after it helps us understand
a searcher's intent and make inferences. Also, by analyzing how
users modify their searches, we've learned related words,
variant grammatical forms, spelling corrections, and the concepts
behind users' information needs. (We're able to make these
connections between searches using cookie IDs — small pieces of
data stored in visitors' browsers that allow us to distinguish
different users. To understand how cookies work, href="http://youtube.com/watch?v=XfZLztx8cKI"
id="e0o:" >watch this video.)

To provide more relevant search results, Google is constantly
developing new techniques for language modeling and building better
models. One element in building better language models is href="http://googleblog.blogspot.com/2005/08/machines-do-translating.html"
id="z0.8" >using more data collected over
longer periods of time. In languages with many documents and users,
such as English, our language models allow us to improve results
deep into the "long tail" of searches, learning about
rare usages. However, for languages with fewer users and fewer
documents on the web, building language models can be a challenge.
For those languages we need to work with longer periods of data to
build our models. For example, it takes more than a year of
searches in Catalan to provide a comparable amount of data as a
single day of searching in English; for Estonian, more than two and
a half years worth of searching is needed to match a day of
English. Having longer periods of data enables us to improve search
for these less commonly used languages.

At Google, we want to ensure that we can help users everywhere find
the things they're looking for; providing accurate, relevant
results for searches in all languages worldwide is core to
Google's mission. Building extensive models of historical usage
in every language we can, especially when there are few users, is
an essential piece of making search work for everyone, everywhere.

Tags: , , , , , , ,

Offline access to Google Docs

Filed under: Official Google Blog — Wrote by Lees on Sunday, May 4th, 2008 @ 2:26 am

Posted by Janani Ravi, Software
Engineer

Our team has a real affinity for free-spirited types, and so we
spend a lot of time thinking up ways to make href="http://docs.google.com/" id="xtf5" >Google Docs friendlier
even to people on the go. If you're one of those, you already
know how you can access your Google Docs from anywhere, how nice it
is to avoid having to email yourself files or back up docs with a
thumbdrive, and how easily you can collaborate with others.

Of course there was a
teeny thing missing: you needed an Internet connection to make
Google Docs work for you. Now, for documents, that's no longer
true. As you'll read on the href="http://googledocs.blogspot.com/" id="f98e"
>Google Docs blog, starting today and
over the coming weeks we're rolling out offline editing access
to word processing documents to Google Docs users. You no longer
need an Internet connection when inspiration strikes. Whether
you're working on an airplane or in a cafe, you can
automatically access all your docs on your own computer.

To see how offline access works, watch this video:

value="http://www.youtube.com/v/7cyHYEfpRVA&hl=en" /> type="application/x-shockwave-flash" wmode="transparent"
height="355" width="425" />

Tags: , , , , , , ,

Last call for scalability papers

Filed under: Official Google Blog — Wrote by Lees on Sunday, May 4th, 2008 @ 2:25 am

Posted by Andrew Schwerin, Software
Engineer

As an engineer in Seattle, I can't wait for summer to arrive.
This year, I'm not just looking forward to the beautiful
weather, but also the href="http://googleblog.blogspot.com/2008/02/seattle-conference-on-scalability-2008.html"
id="xbg0" >Seattle
Scalability Conference. It takes place on Saturday, June
14th.

Conference planning is underway, and the deadline for our call for
papers is this Friday, April 11th! If you have an interesting
approach for building and maintaining scalable systems, this is the
perfect gathering to share it. Send a 500-word abstract of your
30-minute presentation to scalabilityconf at google.com — and plan
on enjoying the long-awaited Seattle sunshine with me this
June.

Here are some of the topics that interest me (but I'm open to
more ideas):

Scalable algorithms:

Parallelization techniques (fully automatic or
programmer-assisted)

Algorithms that are robust in the face of system failures
(flaky hardware, OS bugs, network failures)

Scalable systems:

Managing large, evolving data stores

Languages and tools for organizing high-throughput data
processing systems

Handling partial failure (automatic failure
detection/diagnosis/repair)

We'll be posting more information for conference attendees
soon.

href="http://feeds.feedburner.com/~f/blogspot/MKuf?a=tK60kOG" >

Tags: , , , , , , ,

Haere Mai! Google Maps lands in New Zealand

Filed under: Official Google Blog — Wrote by Lees on Sunday, May 4th, 2008 @ 2:24 am

Posted by Noel Gordon, Software
Engineer

Today we're pleased to announce the full launch of

href="http://maps.google.co.nz/" >Google Maps in New
Zealand
. Although we've offered basic mapping in New
Zealand for some time, today we're unveiling a localised and
customised site for our users in New Zealand. We've added full
local business search capabilities, plus the href="http://www.google.com/local/add/lookup?hl=en-AU&gl=NZ"
>Local Business Centre, so that any Kiwi
business can get a free listing.

While we know New Zealanders care about a lot more than rugby,
racing and beer, we're pleased that now Google Maps can help
you find the nearest place to do any of those things. Plus, you can
find a

href="http://maps.google.co.nz/maps?f=q&hl=en&geocode=&q=cafe ponsonby&ie=UTF8&z=15&om=1"
>cafe in Ponsonby
, an

href="http://maps.google.co.nz/maps?f=q&hl=en&geocode=&q=indian restaurant wellington&ie=UTF8&z=9&om=1"
>Indian restaurant in Wellington
, a

href="http://maps.google.co.nz/maps?f=q&hl=en&geocode=&q=plumber dunedin&ie=UTF8&z=13&om=1"
>plumber in Dunedin
, or a great

href="http://maps.google.co.nz/maps?f=q&hl=en&geocode=&q=queenstown hotel&ie=UTF8&t=h&z=11&om=1"
>hotel for your next holiday
. All available on
a PC or

href="http://www.google.co.nz/gmm/index.html"
>mobile phone
near you!

And of course, Kiwis can also use Google Maps to look up an
address, get

href="http://maps.google.co.nz/maps?f=q&hl=en&geocode=&q=auckland to rotorua&sll=-45.031104,168.662731&sspn=0.33435,0.6427&ie=UTF8&t=h&z=9&om=1"
>driving directions
(and the ability to drag
them around), browse our satellite imagery, and even create custom
maps that you can share with friends, all using the simple
interface.

Tags: , , , , , , ,

Better flight stats results

Filed under: Official Google Blog — Wrote by Lees on Sunday, May 4th, 2008 @ 2:22 am

Posted by Nick Weininger, Software
Engineer

The holiday season is upon us, and we know that
many amongst you are traveling to be with family and friends. To
make sure you spend more time celebrating and less time waiting at
the airport, we're introducing a new flight status feature that
we hope will make your travels easier.

For the latest information on a flight's status, simply search
for an airline and flight number, and the first result will tell
you whether your flight is on time or delayed as well as the
estimated departure and arrival times. Here's a quick example
for a specific American Airlines flight:

onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}"
href="http://bp1.blogger.com/_Ap14FtNN91w/R2cVrILM1dI/AAAAAAAAAgs/X6NK8FiA9q4/s1600-h/Flightstats.gif" >
alt="" id="BLOGGER_PHOTO_ID_5145104929967101394"
border="0" />

Hopefully this makes your makes your travels a little smoother.
Happy holidays from the Google Search team.

P.S. We're still working on building a search feature that will
find your lost luggage ;-)

Tags: , , , , , , ,

New Toolbar adds accessible features

Filed under: Official Google Blog — Wrote by Lees on Saturday, May 3rd, 2008 @ 10:16 pm

Posted by Jonas Klink, Software
Engineer, Accessibility

Last week Google Toolbar for Internet Explorer href="http://www.google.com/intl/en/press/annc/20071212_toolbar.html" >
launched version 5 as a public beta. This version introduces a
number of exciting features, such as making your Toolbar settings
available from any computer that you log into with your Google
Account, improved suggestions for broken links, as well as
important changes that make Toolbar more accessible for assistive
technology users.

This release adds support for Windows Accessibility APIs (used by
screen readers, etc.) and enables keyboard navigation and access.
From inside a browser with Toolbar installed, the global shortcut
Alt G places your cursor in the Google Toolbar search box. If
you're using a screen reader, you'll hear "Google
Toolbar Search". Pressing the Tab key brings keyboard focus to
the button placed immediately after the search box, and right and
left arrow keys move focus between buttons. More information on
keyboard access is documented in the href="http://www.google.com/support/toolbar/bin/search.py?query=accessibility&Action.Search=Search&ctx=en:searchbox" >
Toolbar Help Center (query 'accessibility').

Version 5 comes as a part of our ongoing efforts to enhance
accessibility in our client-side and web applications, which is a
matter I hardly need to mention is very important. Personally, I
see my work that went into the Toolbar as an important step
forward, as the product reaches a very large number of users and
enables everyone to gain quick access to a multitude of useful
features, through a unified UI. Adding keyboard navigation and
other features that enhance the ease of access to these features
benefit everyone.

We look forward to making further improvements to accessibility
(including the installation process) in future releases. You can
download the new Google Toolbar at href="http://toolbar.google.com/T5" >http://toolbar.google.com/T5.

Tags: , , , , , , ,

New Toolbar adds accessible features

Filed under: Official Google Blog — Wrote by Lees on Sunday, April 27th, 2008 @ 1:37 am

Posted by Jonas Klink, Software
Engineer, Accessibility

Last week Google Toolbar for Internet Explorer href="http://www.google.com/intl/en/press/annc/20071212_toolbar.html" >
launched version 5 as a public beta. This version introduces a
number of exciting features, such as making your Toolbar settings
available from any computer that you log into with your Google
Account, improved suggestions for broken links, as well as
important changes that make Toolbar more accessible for assistive
technology users.

This release adds support for Windows Accessibility APIs (used by
screen readers, etc.) and enables keyboard navigation and access.
From inside a browser with Toolbar installed, the global shortcut
Alt G places your cursor in the Google Toolbar search box. If
you're using a screen reader, you'll hear "Google
Toolbar Search". Pressing the Tab key brings keyboard focus to
the button placed immediately after the search box, and right and
left arrow keys move focus between buttons. More information on
keyboard access is documented in the href="http://www.google.com/support/toolbar/bin/search.py?query=accessibility&Action.Search=Search&ctx=en:searchbox" >
Toolbar Help Center (query 'accessibility').

Version 5 comes as a part of our ongoing efforts to enhance
accessibility in our client-side and web applications, which is a
matter I hardly need to mention is very important. Personally, I
see my work that went into the Toolbar as an important step
forward, as the product reaches a very large number of users and
enables everyone to gain quick access to a multitude of useful
features, through a unified UI. Adding keyboard navigation and
other features that enhance the ease of access to these features
benefit everyone.

We look forward to making further improvements to accessibility
(including the installation process) in future releases. You can
download the new Google Toolbar at href="http://toolbar.google.com/T5" >http://toolbar.google.com/T5.

Tags: , , , , , , ,

Copyright © 2007 Google Adsense College.
Powered by GoogleSchool. All Rights Reserved.