AOL has published this week 36 millions search queries from more than 650,000 US users. The data has been anonymized, but you can still use the queries to find information about each person. The data, released by AOL Labs as a research material, stirred a lot of negative controversy, and it was removed from AOL's site. AOL spokesman Andrew Weinstein confessed: "This was a screw-up, and we're angry and upset about it. It was an innocent enough attempt to reach out to the academic community with new research tools, but it was obviously not appropriately vetted, and if it had been, it would have been stopped in an instant."
The valuable data is still available on other sites, including this searchable database and this mirror (439 MB, tgz file). You can find users like 1983280 and track all the searches between March 1st and May 31st this year. The data set includes these fields: UserID, Query, Query Time, Clicked Rank, Destination Domain. So what can you find out about our user? She's a teenager interested in politics, she's from Washington DC, she likes photography and American Idol, one of her parents died and she's about to get married. From other users, you can find the name, the address, the work place and other details that allows identifying the person. New York Times discovered the user no. 4417749, Thelma Arnold, a 62-year-old widow who lives in Lilburn. AOL didn't realize that, in the name of the science, has comitted the biggest privacy breach a search engine ever did. Google didn't let the Government to obtain a similar data set, and AOL, who gets the search results from Google, releases them to the public.
Despite all the privacy considerations, the database is fascinating and it could be the subject of a book about human nature.
What happens when your life is exposed to the public by small fragments of text? You reveal your intentions, your problems and fears, your friendships and your hidden desires. Your queries reveal more than any detective or psychiatrist could find about your life.
Labels
Web Search
Gmail
Google Docs
Mobile
YouTube
Google Maps
Google Chrome
User interface
Tips
iGoogle
Social
Google Reader
Traffic Making Devices
cpp programming
Ads
Image Search
Google Calendar
tips dan trik
Google Video
Google Translate
web programming
Picasa Web Albums
Blogger
Google News
Google Earth
Yahoo
Android
Google Talk
Google Plus
Greasemonkey
Security
software download
info
Firefox extensions
Google Toolbar
Software
OneBox
Google Apps
Google Suggest
SEO Traffic tips
Book Search
API
Acquisitions
InOut
Visualization
Web Design Method for Getting Ultimate Traffic
Webmasters
Google Desktop
How to Blogging
Music
Nostalgia
orkut
Google Chrome OS
Google Contacts
Google Notebook
SQL programming
Google Local
Make Money
Windows Live
GDrive
Google Gears
April Fools Day
Google Analytics
Google Co-op
visual basic
Knowledge
java programming
Google Checkout
Google Instant
Google Bookmarks
Google Phone
Google Trends
Web History
mp3 download
Easter Egg
Google Profiles
Blog Search
Google Buzz
Google Services
Site Map for Ur Site
game download
games trick
Google Pack
Spam
cerita hidup
Picasa
Product's Marketing
Universal Search
FeedBurner
Google Groups
Month in review
Twitter Traffic
AJAX Search
Google Dictionary
Google Sites
Google Update
Page Creator
Game
Google Finance
Google Goggles
Google Music
file download
Annoyances
Froogle
Google Base
Google Latitude
Google Voice
Google Wave
Google Health
Google Scholar
PlusBox
SearchMash
teknologi unik
video download
windows
Facebook Traffic
Social Media Marketing
Yahoo Pipes
Google Play
Google Promos
Google TV
SketchUp
WEB Domain
WWW World Wide Service
chord
Improve Adsence Earning
jurnalistik
sistem operasi
AdWords Traffic
App Designing
Tips and Tricks
WEB Hosting
linux
How to Get Hosting
Linux Kernel
WEB Errors
Writing Content
award
business communication
ubuntu
unik