Moving to Unicode 5.1

Filed under: Official Google Blog — Wrote by Lees on Tuesday, May 13th, 2008 @ 10:04 pm

Posted by Mark Davis, Senior
International Software Architect

Google has just begun supporting href="http://www.unicode.org/press/pr-5.1.html" id="yh5d" >Unicode
5.1, less than one month after it was released. It's now
available in search, so people speaking languages such as Malayalam
can now search for words containing the new characters in Unicode
5.1.

Web pages can use a variety of different character encodings, like
ASCII, Latin-1, or Windows 1252, or href="http://www.unicode.org/book/aboutbook.html#Foreword"
id="c42w" >Unicode. Most encodings can only represent a few
languages, but Unicode will handle anything from Chinese to French
to Arabic. We have long used Unicode as the internal format for all
the text we search: any other encoding is first converted to
Unicode for processing. So we regularly update to each new version
of Unicode (and relevant related standards like href="http://unicode.org/cldr/" id="az_w" >CLDR and href="http://www.rfc-editor.org/rfc/bcp/bcp47.txt"
id="da92" >BCP 47) to make sure we are current. Thus Unicode
plays a key role in our href="http://www.google.com/corporate/" >mission.

Uptick in native Unicode webpages

Just last December there was an interesting
milestone on the web. For the first time, we found that Unicode was
the most frequent encoding found on web pages, overtaking both
ASCII and Western European encodings—and by coincidence, within 10
days of one another. What's more impressive than simply
overtaking them is the speed with which this happened; take a look
at the blue line in this graph.

onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}"
href="http://bp1.blogger.com/_Ap14FtNN91w/SBzrtHJfLnI/AAAAAAAAA5U/TV7_g2_sWq0/s1600-h/Unicode2.gif" >
alt="" id="BLOGGER_PHOTO_ID_5196287230324190834"
border="0" />

You can see a long-term decline in pages encoded in
ASCII (unaccented letters A through Z). More recently, there's
been a significant drop in the use of encodings covering only
Western European letters (ASCII and a few accented letters like Ä,
Ç, and Ø). We're seeing similar declines in other
language-specific encodings. Unicode, on the other hand, is showing
a sharp increase in usage.

This is based on our indexing of web pages, and thus may vary
somewhat from what other search engines find. However, the trends
are pretty clear, and the continued rise in use of Unicode makes it
even easier to do the processing for the many languages that we
cover.

href="http://feeds.feedburner.com/~f/blogspot/MKuf?a=QxeygH" > border="0" />

height="1" width="1" />

Tags: , , , , , , ,

Parcelforce International Hub, Coventry

Filed under: Google Sightseeing — Wrote by Lees on Wednesday, December 19th, 2007 @ 2:03 am

This is the centre of Parcelforce Worldwide’s UK operations,
their
International Hub beside Coventry Airport, England.

Why would I link to the Parcelforce International hub? Well,
apart from being http://en.wikipedia.org/wiki/Parcelforce one of the largest
buildings in the UK, as I write this post, this is the current
location of my brand new http://wii.com/ Nintendo
Wii!

>

So good luck to all of you who’ve ordered a Wii from Amazon, as
yours is probably in this warehouse too… Here’s hoping you’ll all
get them first thing tomorrow! Worringly, in my case it does seem
to have http://maps.google.co.uk/maps?f=d&hl=en&saddr=CV3 4PF&daddr=EH8&sll=52.362943,-1.475987&sspn=0.012147,0.027938&ie=UTF8&z=7&t=h&om=1
a rather long way to come… src='/wp-content/uploads/2007/200712191732073477802.gif'
alt=':-(' class='wp-smiley'>

Tags: , , , ,

YouTube in 9 more domains

Filed under: Official Google Blog — Wrote by Lees on Tuesday, December 11th, 2007 @ 6:18 am

Posted by Sakina Arsiwala,
International Manager, YouTube

The YouTube community began life speaking English, but thanks to
the uniquely expressive medium of video, today there is a global
village of content makers and viewers. With a noteworthy number of
YouTube visitors now coming from outside the U.S., it's high
time we go multilingual.

Today at a Google press event in Paris, Chad Hurley and Steve Chen
are announcing the launch of nine new domains in href="http://www.youtube.com.br/" >Brazil, href="http://www.youtube.fr/" >France, href="http://it.youtube.com/" >Italy, href="http://www.youtube.jp/" >Japan, href="http://www.youtube.nl/" >the Netherlands, href="http://www.youtube.pl/" >Poland, href="http://www.youtube.es/" >Spain, href="http://www.youtube.ie/" >Ireland, and the href="http://www.youtube.co.uk/" >UK.

In response to many requests, each new site is fully translated and
localized for each country including content (Featured Videos,
Director Videos, Promotions), as well as the interfaces, search,
user support, and such community features as video ratings,
sharing, and content flagging. And these new localized versions are
built using Google search technology, so you can quickly find more
of what you want to see. Perhaps best of all, you can continue to
use youtube.com, or move to
one of these localized sites — and switch seamlessly between the
two. Happy creating, viewing and sharing!

Update:
Removed link to user content.

Tags: , , , , , , ,

Copyright © 2007 Google Adsense College.
Powered by GoogleSchool. All Rights Reserved.