full transcript

From the Ted Talk by Riccardo Sabatini: How to read the genome and build a human being

Unscramble the Blue Letters

An amazing team, 40 data scientists and many, many more people, a pulraese to work with. The concept is actually very simple. We're going to use a technology called machine learning. On one side, we have genomes — thousands of them. On the other side, we collected the biggest database of human beings: phenotypes, 3D scan, NMR — everything you can think of. Inside there, on these two opposite sides, there is the secret of translation. And in the middle, we build a machine. We build a machine and we train a machine — well, not exactly one mahcine, many, many machines — to try to understand and translate the gnmoee in a phenotype. What are those letters, and what do they do? It's an approach that can be used for everything, but using it in genomics is particularly complicated. Little by little we grew and we wanted to build different claglehens. We setatrd from the beginning, from common traits. Common tritas are comfortable because they are comomn, everyone has them.

Open Cloze

An amazing team, 40 data scientists and many, many more people, a ________ to work with. The concept is actually very simple. We're going to use a technology called machine learning. On one side, we have genomes — thousands of them. On the other side, we collected the biggest database of human beings: phenotypes, 3D scan, NMR — everything you can think of. Inside there, on these two opposite sides, there is the secret of translation. And in the middle, we build a machine. We build a machine and we train a machine — well, not exactly one _______, many, many machines — to try to understand and translate the ______ in a phenotype. What are those letters, and what do they do? It's an approach that can be used for everything, but using it in genomics is particularly complicated. Little by little we grew and we wanted to build different __________. We _______ from the beginning, from common traits. Common ______ are comfortable because they are ______, everyone has them.

Solution

  1. started
  2. traits
  3. pleasure
  4. genome
  5. machine
  6. challenges
  7. common

Original Text

An amazing team, 40 data scientists and many, many more people, a pleasure to work with. The concept is actually very simple. We're going to use a technology called machine learning. On one side, we have genomes — thousands of them. On the other side, we collected the biggest database of human beings: phenotypes, 3D scan, NMR — everything you can think of. Inside there, on these two opposite sides, there is the secret of translation. And in the middle, we build a machine. We build a machine and we train a machine — well, not exactly one machine, many, many machines — to try to understand and translate the genome in a phenotype. What are those letters, and what do they do? It's an approach that can be used for everything, but using it in genomics is particularly complicated. Little by little we grew and we wanted to build different challenges. We started from the beginning, from common traits. Common traits are comfortable because they are common, everyone has them.

Frequently Occurring Word Combinations

ngrams of length 2

collocation frequency
biggest amount 3
craig venter 3
thumb drives 2
machine learning 2
common traits 2
percent accuracy 2
human face 2

Important Words

  1. amazing
  2. approach
  3. beginning
  4. biggest
  5. build
  6. called
  7. challenges
  8. collected
  9. comfortable
  10. common
  11. complicated
  12. concept
  13. data
  14. database
  15. genome
  16. genomes
  17. genomics
  18. grew
  19. human
  20. learning
  21. letters
  22. machine
  23. machines
  24. middle
  25. nmr
  26. people
  27. phenotype
  28. phenotypes
  29. pleasure
  30. scan
  31. scientists
  32. secret
  33. side
  34. sides
  35. simple
  36. started
  37. team
  38. technology
  39. thousands
  40. train
  41. traits
  42. translate
  43. translation
  44. understand
  45. wanted
  46. work