Conservation DistList Archives [Date] [Subject] [Author] [SEARCH]

Subject: Google Books Ngram Viewer

Google Books Ngram Viewer

From: Lisa Mibach <lm<-at->
Date: Sunday, December 19, 2010
Google has launched Google Books Ngram Viewer, a data visualization
tool that crawls 500 billion words culled from 5.2 million books
published between 1500 and 2008 that Google has indexed in its cloud
computing system.

One example shows how the tool can compare instances of musical
instruments in English literature from 1750 to 2008. Note how the
drum and trumpet, in particular, seemed to trade places in
popularity over the last two hundred years.
    <URL:http://googleblog.blogspot.com/
      2010/12/find-out-whats-in-word-or five-with.html>

Google said it is also making the datasets supporting the Ngram
Viewer freely downloadable so that scholars can replicate the work.
The datasets were used in a research project led by Harvard
University's Jean-Baptiste Michel and Erez Lieberman Aiden, along
with several Googlers, said Jon Orwant, Google Books engineering
manager.

   "Their work provides several examples of how quantitative methods
    can provide insights into topics as diverse as the spread of
    innovations, the effects of youth and profession on fame, and
    trends in censorship," Orwant said.

Users may access the tool here

    <URL:http://ngrams.googlelabs.com>

and type up to five words to see a typical Google graph that counts
the use of words and phrases each year over the last several hundred
years. The words come from books published in Chinese, English,
French, German, Russian and Spanish.

This assumes that that the source has been indexed in Google, which
may not be the case for some of our early classics, such as Stolow,
Feller, Jones "On Picture Varnishes and Their Solvents" or Giorgio
Torraca's "Properties of Synthetic Resins" which contain information
that is still valuable. This hints at a potential project that might
be undertaken by one or more of our conservation institutions,
perhaps as a web-based collaborative effort such as a wiki. See
<URL:http://en.wikipedia.org/wiki/Pbworks> for a useful tool

Potential uses of Ngrams in conservation include:

    A search for uses of materials such as soluble nylon, methyl
    methyl methacrylate, and poly n-butyl methacrylate to examine
    the rise and fall of their uses;

    The prevalence of techniques such as canvas-backing or strappo;
    The user is then left to consider the reasons for these trends,
    be they personal loyalties, research and publications, or
    geographical communications ....

    Variations in uses of particular types of artifacts such as
    harpsichords

Articles describing Ngram:

  <URL:http://www.eweek.com/c/a/Search-Engines/
  Google-Ngram-Viewer-Gauges-Word-Popularity-Over-Centuries-179056>

  <URL:http://googleblog.blogspot.com/2010/12/
  find-out-whats-in-word-or-five-with.html>

The New York Times

    http://www.nytimes.com/2010/12/17/books/17words.html

Wall Street Journal

  <URL:http://online.wsj.com/article/
     SB10001424052748704073804576023741849922006.html>

  <URL:http://www.readwriteweb.com/
     archives/new_visualization_tool_from_google_with_data_from.php>

     <URL:http://www.readwriteweb.com/archives/
       10_fascinating_word_graphs_from_200_years_of_googl.php>

    **** Moderator's comments: The above URLs have been wrapped for
    email. There should be no newlines.

Wishing you all happy searches over the holidays,

Lisa Mibach
11 rue Forgie
Gatineau QC
J9H 1W7 Canada
819-685-2323
Mobile: 613-355-3278


                                  ***
                  Conservation DistList Instance 24:31
                 Distributed: Monday, December 20, 2010
                       Message Id: cdl-24-31-007
                                  ***
Received on Sunday, 19 December, 2010

[Search all CoOL documents]