Quantitative Analysis of Culture Using Millions of Digitized Books

被引:1492
作者
Michel, Jean-Baptiste [1 ,2 ,3 ,4 ,5 ]
Shen, Yuan Kui [2 ,6 ,7 ]
Aiden, Aviva Presser [2 ,6 ]
Veres, Adrian [2 ,6 ,8 ]
Gray, Matthew K. [9 ]
Pickett, Joseph P. [10 ]
Hoiberg, Dale [11 ]
Clancy, Dan [9 ]
Norvig, Peter [9 ]
Orwant, Jon [9 ]
Pinker, Steven [5 ]
Nowak, Martin A. [1 ,12 ,13 ]
Aiden, Erez Lieberman [1 ,2 ,6 ,13 ,14 ,15 ,16 ]
机构
[1] Harvard Univ, Program Evolutionary Dynam, Cambridge, MA 02138 USA
[2] Harvard Univ, Cultural Observ, Cambridge, MA 02138 USA
[3] Harvard Univ, Inst Quantitat Social Sci, Cambridge, MA 02138 USA
[4] Harvard Univ, Dept Psychol, Cambridge, MA 02138 USA
[5] Harvard Univ, Sch Med, Dept Syst Biol, Boston, MA 02115 USA
[6] Harvard Univ, Lab Large, Cambridge, MA 02138 USA
[7] MIT, Comp Sci & Artificial Intelligence Lab, Cambridge, MA 02139 USA
[8] Harvard Univ, Cambridge, MA 02138 USA
[9] Google, Mountain View, CA 94043 USA
[10] Houghton Mifflin Harcourt, Boston, MA 02116 USA
[11] Encyclopaedia Britannica, Chicago, IL 60654 USA
[12] Harvard Univ, Dept Organism & Evolutionary Biol, Cambridge, MA 02138 USA
[13] Harvard Univ, Dept Math, Cambridge, MA 02138 USA
[14] Harvard Univ, Broad Inst Harvard & MIT, Cambridge, MA 02138 USA
[15] Harvard Univ, Sch Engn & Appl Sci, Cambridge, MA 02138 USA
[16] Harvard Univ, Harvard Soc Fellows, Cambridge, MA 02138 USA
关键词
D O I
10.1126/science.1199644
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
We constructed a corpus of digitized texts containing about 4% of all books ever printed. Analysis of this corpus enables us to investigate cultural trends quantitatively. We survey the vast terrain of 'culturomics,' focusing on linguistic and cultural phenomena that were reflected in the English language between 1800 and 2000. We show how this approach can provide insights about fields as diverse as lexicography, the evolution of grammar, collective memory, the adoption of technology, the pursuit of fame, censorship, and historical epidemiology. Culturomics extends the boundaries of rigorous quantitative inquiry to a wide array of new phenomena spanning the social sciences and the humanities.
引用
收藏
页码:176 / 182
页数:7
相关论文
共 28 条
  • [1] Algeo J., 1991, 50 YEARS NEW WORDS D
  • [2] [Anonymous], 1999, WORDS RULES
  • [3] [Anonymous], 1997, FRENZY RENOWN FAME I
  • [4] [Anonymous], 1935, The Psychobiology of Language
  • [5] [Anonymous], 1981, Cultural transmission and evolution: A quantitative approach
  • [6] Barron Stephanie., 1991, Degenerate Art: The Fate of the Avant-Garde in Nazi Germany
  • [7] Barry JohnM., 2004, THE GREAT INFLUENZA
  • [8] From usage to grammar: The mind's response to repetition
    Bybee, Joan
    [J]. LANGUAGE, 2006, 82 (04) : 711 - 733
  • [9] Ebbinghaus H., 1987, MEMORY CONTRIBUTION
  • [10] Gove P.B., 1993, WEBSTERS 3 NEW INT D