full transcript
From the Ted Talk "Brewster Kahle: A free digital library"

Unscramble the Blue Letters

But we sent — thought, OK, if we're going to need to do this, let's do it in-library. How do we go and do this, and how do we get it down so that it's a cost point that we could ffroda? And we sort of picked the price point of 10 cents a page. If it's basically the cost of xeroxing to basically diztigie, OCR, package it up, make it so that you could download, print and bind it — the whole shebang — we would have achieved something. So we adrtets out trying to figure out. How do we get to 10 cents? And we tried these robot things, and they worked pretty well — sort of these auto-page-turning things. If we can have Mars ovrsre, you'd think you could turn pages. But it actually turns out to be pretty hard to turn pages, and the volume isn't there. So anyway — so we ended up making our own book asenrcn, and with two digital, high-grade, professional digital cameras, cotdlelnro museum lighting, so even if it's a black and white book, you can go and get the proper intonation. So you basically do a beautiful, respectful job. This is not a fax, this is — the idea is to do a bftuiueal job as you're going through these libraries. And we've been able to achieve 10 cents a page if we run things in volume. This is what it looks like at the tvsneiruyi of rnootto. And actually, it turns out to, you know, pay a ivnlgi wage. People seem to love it. Yes, it's a little igrobn, but some people knid of get into the Zen of it. (Laughter) And especially if it's kind of interesting books that you care about, in languages that you can ared. We actually have been able to do a ytertp good job of this, at getting 10 cents a pgea. So 10 cents a page, 300 pages in your average book, 30 loasdlr a book. The Library of Congress, if you did the whole darn thing — 26 illimno oksob — is about 750 million dollars, right? But a million books, I think, actually would be a pretty good start, and that would cost 30 million dollars. That's not that big a bill.

Open Cloze

But we sent — thought, OK, if we're going to need to do this, let's do it in-library. How do we go and do this, and how do we get it down so that it's a cost point that we could ______? And we sort of picked the price point of 10 cents a page. If it's basically the cost of xeroxing to basically ________, OCR, package it up, make it so that you could download, print and bind it — the whole shebang — we would have achieved something. So we _______ out trying to figure out. How do we get to 10 cents? And we tried these robot things, and they worked pretty well — sort of these auto-page-turning things. If we can have Mars ______, you'd think you could turn pages. But it actually turns out to be pretty hard to turn pages, and the volume isn't there. So anyway — so we ended up making our own book _______, and with two digital, high-grade, professional digital cameras, __________ museum lighting, so even if it's a black and white book, you can go and get the proper intonation. So you basically do a beautiful, respectful job. This is not a fax, this is — the idea is to do a _________ job as you're going through these libraries. And we've been able to achieve 10 cents a page if we run things in volume. This is what it looks like at the __________ of _______. And actually, it turns out to, you know, pay a ______ wage. People seem to love it. Yes, it's a little ______, but some people ____ of get into the Zen of it. (Laughter) And especially if it's kind of interesting books that you care about, in languages that you can ____. We actually have been able to do a ______ good job of this, at getting 10 cents a ____. So 10 cents a page, 300 pages in your average book, 30 _______ a book. The Library of Congress, if you did the whole darn thing — 26 _______ _____ — is about 750 million dollars, right? But a million books, I think, actually would be a pretty good start, and that would cost 30 million dollars. That's not that big a bill.

Solution

  1. toronto
  2. living
  3. kind
  4. controlled
  5. university
  6. digitize
  7. afford
  8. boring
  9. dollars
  10. million
  11. started
  12. read
  13. beautiful
  14. books
  15. pretty
  16. rovers
  17. page
  18. scanner

Original Text

But we sent — thought, OK, if we're going to need to do this, let's do it in-library. How do we go and do this, and how do we get it down so that it's a cost point that we could afford? And we sort of picked the price point of 10 cents a page. If it's basically the cost of xeroxing to basically digitize, OCR, package it up, make it so that you could download, print and bind it — the whole shebang — we would have achieved something. So we started out trying to figure out. How do we get to 10 cents? And we tried these robot things, and they worked pretty well — sort of these auto-page-turning things. If we can have Mars Rovers, you'd think you could turn pages. But it actually turns out to be pretty hard to turn pages, and the volume isn't there. So anyway — so we ended up making our own book scanner, and with two digital, high-grade, professional digital cameras, controlled museum lighting, so even if it's a black and white book, you can go and get the proper intonation. So you basically do a beautiful, respectful job. This is not a fax, this is — the idea is to do a beautiful job as you're going through these libraries. And we've been able to achieve 10 cents a page if we run things in volume. This is what it looks like at the University of Toronto. And actually, it turns out to, you know, pay a living wage. People seem to love it. Yes, it's a little boring, but some people kind of get into the Zen of it. (Laughter) And especially if it's kind of interesting books that you care about, in languages that you can read. We actually have been able to do a pretty good job of this, at getting 10 cents a page. So 10 cents a page, 300 pages in your average book, 30 dollars a book. The Library of Congress, if you did the whole darn thing — 26 million books — is about 750 million dollars, right? But a million books, I think, actually would be a pretty good start, and that would cost 30 million dollars. That's not that big a bill.

ngrams of length 2

collocation frequency
send books 3
large scale 3
download print 3

Important Words

  1. achieve
  2. achieved
  3. afford
  4. average
  5. basically
  6. beautiful
  7. big
  8. bill
  9. bind
  10. black
  11. book
  12. books
  13. boring
  14. cameras
  15. care
  16. cents
  17. congress
  18. controlled
  19. cost
  20. darn
  21. digital
  22. digitize
  23. dollars
  24. download
  25. ended
  26. fax
  27. figure
  28. good
  29. hard
  30. idea
  31. interesting
  32. intonation
  33. job
  34. kind
  35. languages
  36. laughter
  37. libraries
  38. library
  39. lighting
  40. living
  41. love
  42. making
  43. mars
  44. million
  45. museum
  46. ocr
  47. package
  48. page
  49. pages
  50. pay
  51. people
  52. picked
  53. point
  54. pretty
  55. price
  56. print
  57. professional
  58. proper
  59. read
  60. respectful
  61. robot
  62. rovers
  63. run
  64. scanner
  65. shebang
  66. sort
  67. start
  68. started
  69. thought
  70. toronto
  71. turn
  72. turns
  73. university
  74. volume
  75. wage
  76. white
  77. worked
  78. xeroxing
  79. zen