外语招生网
 外语报名咨询热线:010-51294614、51299614  ||  热点:环球雅思2010年精品课程抢鲜报
 雅思·IELTS新托福·TOEFL四六级PETS商务英语职称英语小语种翻译少儿英语GREGMAT | 其他外语考试

Why Google keeps your data forever, tracks you with ads

作者:不详   发布时间:2010-03-12 09:45:40  来源:网络
  • 文章正文
  • 调查
  • 热评
  • 论坛

  Not many companies could get away with defending controversial data retention practices by saying that the data is needed to "learn from good guys, fight off bad guys, [and] invent the future." But that's how Google sees itself and its practices—not surprising from a company that would give itself an unofficial motto like "don't be evil."
  I had the chance recently to sit down with two of Google's top privacy people: deputy general counsel Nicole Wong and security/privacy engineer Alma Whitten. While the "good guy/bad guy" and "don't be evil" quotes may seem too cute by half to some, Wong and Whitten made a strong pitch for the truth of both slogans. In their view, Google really is fighting the good fight when it comes to your online privacy.
  Anonymization and its discontents
  Google logs an astonishing amount of data, including the search logs from its flagship product. It keeps this data indefinitely, so searching for a combination of yourwife'sname and youraddress and "rat poison in her cereal" is not a particularly smart idea (though search users do this sort of thing anyway).
  But the company does "anonymize" this data eventually. The last octet of the IP address is wiped after nine months, which means there are 254 possibilities for the IP address in question (.0 and .255 are reserved addresses). After 18 months, Google anonymizes the unique cookie data stored in these logs.
  This isn't especially ambitious; Europe's data protection supervisors have called for IP anonymization after six months and competing search engines like Bing do just that (and Bing removes the entire IP address, not just the last octet). Yahoo scrubs its data after 90 days.
  But Whitten, who was involved in Google's decisions on such issues, said that Google has done the best it can to keep the retention period to a minimum while still extracting maximum value from that data... and that this "value" isn't just to Google but also to users.
  "Wonderful things that can be done with an abundance of data," she said. When Google's teams began looking at the data retention issue a few years back, they "started with zero" and tried to see if they could make it work. They could not; Google would lose the ability to do too many useful things.
  Search data is mined to "learn from the good guys," in Google's parlance, by watching how users correct their own spelling mistakes, how they write in their native language, and what sites they visit after searches. That information has been crucial to Google's famously algorithm-driven approach to problems like spell check, machine language translation, and improving its main search engine. Without the algorithms, Google Translate wouldn't be able to support less-used languages like Catalan and Welsh.
  Data is also mined to watch how the "bad guys" run link farms and other Web irritants so that Google can take countermeasures.
  Google eventually settled on anonymizing the IP address after nine months, though even here, "we believe that we have lost the ability to do things," said Whitten.
  Web users don't mind being tracked?
  Instead of cutting the data retention period further, Google is more focused on 1) transparency and 2) keeping the data locked down safely. The company believes that when users know what Google keeps and why it keeps it—and when they have the chance to opt out—users are often happy to let Google do its thing.
  Wong points to behavioral advertising, which Google jumped into last year. This sort of advertising relies on a vast ad network across many sites, and the ads record a visitor's unique cookie. Google can collate this data on the back end and compile a list of interest categories associated with a particular user cookie; since most users never clean their cookies, this works well as a general ad targeting mechanism.
  When Google rolled out the system in March 2009, VP Susan Wojcicki said the things that advertisers always say on such occasions: this is good for consumers.
  "We believe there is real value to seeing ads about the things that interest you," she wrote. "If, for example, you love adventure travel and therefore visit adventure travel sites, Google could show you more ads for activities like hiking trips to Patagonia or African safaris. While interest-based advertising can infer your interest in adventure travel from the websites you visit, you can also choose your favorite categories, or tell us which categories you don't want to see ads for."
  Choosing your favorite categories—and opting-out of behavioral ads altogether—is made possible by Google's Ads Preferences manager. The site gets limited use; despite the hundreds of millions who use Google services or are served Google ads, only "tens of thousands" visit the Ads Preferences site each week, I'm told. One might assume that these would be the most motivated "opt-outers," those who actually understand what behavioral advertising is, know how it works, and hate it with a passion.
  The Google folks insist that this isn't actually what happens when people visit the Ad Preferences page. Compared to the number of people who choose to opt out entirely, four times more people merely edit their categories, while ten times more people do nothing at all.
  This could mean several things (are most users just confused about the options and simply do nothing?), but Google takes it as vindication of its willingness to be transparent about what it does, and its willingness to put users in control. Certainly, there are other companies that could take a page from the Google playbook. The Ads Preferences manager makes it simple to opt out with single click, but this only applies to one browser; Google has also built a browser plugin that can remember the setting across browsers and after cookie purges.
  Given the sheer amount of hate directed at Google-owned Doubleclick that erupted in our recent comment thread on ad blocking, though, it looks like Google still has some ways to go before it convinces the geekerati that its opt-out behavioral targeting practices truly aren't "evil."
  As Google services rack up increasing amount of data on users, the company's strategy for reassuring users is based on such transparency, user control, and data safety. Whitten stresses with pride that Google's data doesn't leak, and Wong notes how aggressively the company pushed back against a broad Department of Justice data request in 2005.
  "We're not holding onto this frivolously," Whitten said. "It's fundamental to bring value to our users."

以下网友留言只代表网友个人观点,不代表本站观点。 立即发表评论
提交评论后,请及时刷新页面!               [回复本贴]    
用户名: 密码:
验证码: 匿名发表
外语招生最新热贴:
【责任编辑:苏婧  纠错
阅读下一篇:下面没有链接了
【育路网版权与免责声明】  
    ① 凡本网注明稿件来源为"原创"的所有文字、图片和音视频稿件,版权均属本网所有。任何媒体、网站或个人转载、链接、转贴或以其他方式复制发表时必须注明"稿件来源:育路网",违者本网将依法追究责任;
    ② 本网部分稿件来源于网络,任何单位或个人认为育路网发布的内容可能涉嫌侵犯其合法权益,应该及时向育路网书面反馈,并提供身份证明、权属证明及详细侵权情况证明,育路网在收到上述法律文件后,将会尽快移除被控侵权内容。
外语报名咨询电话:010-51294614、51299614
外语课程分类
 
-- 大学英语---
专四专八英语四六级公共英语考研英语
-- 出国考试---
雅思托福GREGMAT
-- 职业英语---
BEC翻译职称英语金融英语托业
博思实用商务面试英语
-- 实用英语---
口语新概念外语沙龙口语梦工场口语
VIP翻译
-- 小语种----
日语法语德语韩语俄语阿拉伯语
西班牙语意大利语其它语种
热点专题·精品课程
 
外语课程搜索
课程关键词:
开课时间:
价格范围: 元 至
课程类别:
学员报名服务中心: 北京北三环西路32号恒润中心1803(交通位置图
咨询电话:北京- 010-51268840/41 传真:010-51418040 上海- 021-51567016/17
育路网-中国新锐教育社区: 北京站 | 上海站 | 郑州站| 天津站
本站法律顾问:邱清荣律师
1999-2010 育路教育版权所有| 京ICP备05012189号