Demis Hassabis (DeepMind Co-founder) – Using AI to accelerate scientific discovery | MIT (Apr 2022)
Chapters
00:00:00 DeepMind and the Advancement of Scientific Discovery with AI
Demis’s Return to MIT: Demis Hassabis, co-founder of DeepMind, has maintained a close relationship with MIT since his time as a visiting postdoc in 2009. His advisory role in MIT’s Center for Brains, Minds, and Machines (CBMM) has been instrumental in shaping the center’s research directions. Hassabis’s regular visits to MIT every three years provide an opportunity for him to share DeepMind’s progress and engage with the academic community.
AlphaFold’s Impact on Biology: Hassabis emphasizes the significance of AlphaFold, a DeepMind project that predicts protein structures from amino acid sequences. He invites biologists in the audience to share their feedback on AlphaFold and explores potential collaborations.
Recapitulating Past Work Leading to AI in Science: Hassabis highlights DeepMind’s journey over the past 10-12 years, leading to the current stage where AI is meaningfully applied to scientific discovery. He aims to connect the dots between DeepMind’s earlier work on games and its current focus on using AI for scientific advancements.
DeepMind’s Founding and Mission: DeepMind was established in 2010 with an ambitious vision to develop artificial general intelligence (AGI) or human-level AI. The company’s mission is to solve intelligence and use it to advance science and benefit humanity.
Solving Intelligence: Hassabis defines solving intelligence as fundamentally understanding the phenomenon and recreating it artificially to achieve AGI. He believes that a general approach to solving intelligence can lead to algorithms applicable to a wide range of challenging tasks, mirroring the capabilities of the human brain.
Learning Systems Approach: DeepMind’s approach to AI involves learning systems, particularly deep reinforcement learning (RL). RL systems observe data streams, build models of their environment, and learn to achieve specific goals or maximize rewards within those environments. These systems can potentially discover new knowledge from first principles through trial and error.
AlphaGo’s Success and Significance: AlphaGo, DeepMind’s program for playing the game of Go, was a landmark achievement in deep RL. Hassabis highlights the challenge of Go for computers due to its vast search space and esoteric nature.
00:11:51 The AlphaGo System: A Revolutionary Approach to Two-Player Games
Background: Go is a challenging game due to its complexity and reliance on intuition and pattern matching. AlphaGo and its successors use a self-play training approach to improve their evaluation and move prediction skills.
Training Process: Start with a random neural network. Train the network through hundreds of thousands or millions of self-play games. Use the generated data to train a new version of the network (V2). Conduct a mini-tournament between V2 and V1. If V2 wins by a certain threshold (e.g., 55%), replace V1 with V2. Repeat the process until V2 consistently beats V1.
Results: AlphaGo achieved superhuman performance in various two-player games, including Go, within 17 to 20 iterations.
Generalization to Other Games: AlphaGo’s underlying approach can be applied to any two-player perfect information game.
Searching a Constrained Space: AlphaGo’s neural network learns a model of the environment, constraining the search space for possible moves.
Original Go Strategies: AlphaGo demonstrated creativity by introducing original strategies, such as move 37 in game two. This move, initially considered unconventional, later became a decisive factor in winning the game.
AI Systems and Interpolation: AI systems excel at interpolation tasks, such as distinguishing between images of cats and dogs.
AI Creativity: AI systems can exhibit creativity in Go by developing original strategies that are not simply interpolations of known strategies. However, AI systems currently lack the ability for true invention, such as inventing new games like Go or Chess.
Generalization: DeepMind’s approach to engineering involves focusing on performance first and then generalizing the system to make it less specific to a particular task. AlphaZero achieved remarkable results by beating the best programs in various games, including AlphaGo, within a few hours of training from nothing. Generalization can also provide performance benefits, as seen with AlphaZero’s improved strength compared to AlphaGo.
Analytics and Understanding: AlphaZero’s success prompted efforts to analyze and understand the system’s inner workings, aiming to open the “black box” and gain insights into its knowledge representation and limitations.
Chess and Stockfish: AlphaZero’s capabilities were also demonstrated in chess, where it surpassed Stockfish 8, the reigning world champion chess computer.
00:22:27 AI Applications in Game Theory and Scientific Discovery
AlphaZero’s Unique Style of Chess: AlphaZero invented a new style of chess, favoring mobility over materiality, and its games were considered beautiful and aesthetically pleasing. AlphaZero sacrificed pieces and pawns to imprison black pieces, leading to the “immortal Zagzwang game,” where no move could improve the black position.
Advantages of AlphaZero over Traditional Chess Engines: AlphaZero doesn’t have inbuilt rules and valuation tables like traditional chess engines. It is freer to evaluate positions based on contextual factors and make sacrifices when advantageous. AlphaZero balances different factors more effectively and can dynamically adjust its evaluation based on the position. It requires less search for strong play, making it more efficient than traditional chess engines.
AlphaZero’s Influence on Human Chess Players: Magnus Carlsen and Garry Kasparov praised AlphaZero’s style and approach to chess. Carlsen acknowledged AlphaZero’s influence on his own play. Kasparov noted that AlphaZero’s learning process allows it to reflect the “truth” of chess.
DeepMind’s Approach to Scientific Discovery: DeepMind seeks out massive combinatorial search spaces where exhaustive brute force approaches are not tractable. It requires a clear objective function or measure to optimize against. Sufficient data or an accurate and efficient simulator is necessary for learning and progress. DeepMind has successfully used synthetic data generated from simulators to augment real data.
DeepMind Technologies in Commercial Products: DeepMind technology is used in various Google products, including YouTube recommendations, Android battery life optimization, and text-to-speech systems.
Protein Folding and Its Importance: Protein folding is the process of determining a protein’s 3D structure from its amino acid sequence. Proteins are essential to life and their 3D structure determines their function and drug targets. DeepMind’s AlphaFold tackles the protein folding problem and has achieved significant breakthroughs.
Understanding the Protein Folding Problem: Determining protein structure experimentally is a time-consuming and laborious process. Christian Anfinsen’s Nobel lecture sparked a 50-year quest to predict protein structure directly from its amino acid sequence.
Leventhal’s Paradox: Leventhal’s paradox highlights the computational challenges of protein structure prediction due to the vast number of possible shapes a protein can take. Despite this, proteins spontaneously fold efficiently in nature, indicating the existence of an efficient solution.
Personal Journey with Protein Folding: Demis Hassabis’s early fascination with the protein folding problem as a student in Cambridge. Belief in AI’s potential to solve the problem in the future, despite limitations of AI techniques in the 1990s.
00:36:27 Evolution of Protein Folding Prediction Techniques and the Impact of AlphaFold
Background: Demis Hassabis’ first encounter with the protein folding problem was through the Foldit game, a citizen science game that allowed gamers to contribute to science by solving protein folding puzzles. Hassabis was intrigued by the idea of using gamers’ intuition and pattern matching skills to find protein folds, and he noticed that some players were able to find real structures that were published in scientific journals.
Mimicking Intuition: After AlphaGo’s success in mimicking the intuition of Go masters, Hassabis realized that it might be possible to apply similar techniques to protein folding. Protein folding is a highly complex problem, and greedy energy minimization methods often get stuck in local maxima, resulting in incorrect folds. Hassabis believed that AlphaFold could use its intuition, gained through training on large datasets, to find better solutions than traditional methods.
CASP Competition: CASP is a biennial competition that assesses the accuracy of protein structure prediction methods. Experimental departments produce new protein structures and submit them to CASP organizers, who withhold the structures from the competing teams. Teams then have one week to predict the structures using their methods, and the organizers compare the predicted structures to the experimentally determined ground truth. CASP has been running since 1994 and is considered the Olympics of protein folding.
AlphaFold’s Breakthrough: AlphaFold was the first method to use cutting-edge machine learning as the core component of its solution to the protein folding problem. In CASP 2018, AlphaFold won the competition by a significant margin, achieving an accuracy level that was almost 50% higher than the next best team. This breakthrough revolutionized the field of protein folding and demonstrated the power of machine learning in addressing complex scientific problems.
Stagnant Progress: Prior to AlphaFold’s entry in CASP 2018, there had been minimal progress in protein structure prediction for over a decade. Traditional methods had reached a plateau, and the field was in need of new approaches. AlphaFold’s success highlighted the potential of machine learning to accelerate scientific progress and address long-standing challenges.
00:43:57 AlphaFold 2: Protein Structure Prediction at Atomic Accuracy
Key Advances of AlphaFold2 over AlphaFold1: End-to-end system with iterative recycling stage. Direct prediction from amino acid sequence to 3D structure. Attention-based neural network for inferring implicit graph structure. Incorporation of evolutionary and physical constraints into the model architecture.
Accuracy and Performance: Atomic accuracy achieved at Cas14 competition. Less than 1 Angstrom error, surpassing experimental methods. Significantly more accurate than other systems.
Broad Access to the Biology Community: Folded entire human proteome within two weeks. Doubled the experimentally determined human proteome to 36% with sub-angstrom accuracy. Determined another 58% at high accuracy, useful for fundamental research.
Confidence Metric for Biologists: Developed a confidence metric to assess the reliability of predictions. Greater than 90 indicates experimental level accuracy. Greater than 70 indicates high accuracy regime. Lower than 50 may indicate disorder.
Folded 20 Model Organisms and Major Diseases: Expanded the scope beyond human proteome to include important research organisms and diseases. Aimed to cover more of the biology space.
00:55:06 AlphaFold Protein Structure Database and Its Impact on Biology
Database Creation: AlphaFold Protein Structure Database was created in collaboration with EMBL-EBI to store 3D structures and other tools for biologists.
Free and Open Access: The database was released for free with unrestricted access for any use, maximizing scientific impact and benefit to humanity.
Database Content: It initially contained 330,000 structures for 20 model organisms and the human proteome. Now expanded to over a million predictions, including 440,000 annotated ones from Swiss prot.
Focus on Neglected Tropical Diseases: Collaborated with the WHO to prioritize understudied neglected tropical diseases like Chagas disease and Leishmaniasis. This enables non-profits to initiate drug design efforts with the available protein structures.
Applications and Impact: AlphaFold has been used to model complex biological structures previously not possible, such as the nuclear pore complex. It has aided in predicting protein disorder and identifying protein structures for pathogens with pandemic potential. Over 350,000 researchers have utilized the database, reaching nearly all biologists worldwide.
Future Directions: Aiming to expand the database to include nearly 200 million known proteins, covering a vast range of organisms.
What’s Next for AlphaFold?: Researchers are working on applying AlphaFold to study protein complexes, disordered proteins, point mutations, and protein dynamics. They are also exploring the use of AlphaFold in protein design.
AlphaFold as a Proof of Concept for Digital Biology: Demis Hassabis believes that AlphaFold is a proof of concept for a new era of digital biology. He envisions creating a virtual cell that can be used to make useful predictions in silico.
AI as the Perfect Description Language for Biology: Hassabis believes that AI is the perfect description language for biology because it is an emergent process that is too complex to be described mathematically in a clean way.
AI as the Ultimate General Purpose Tool for Scientists: Hassabis believes that AI used properly could be the ultimate general purpose tool to help scientists see further, much like the Hubble telescope was for cosmologists.
Abstract
Revolutionizing Science and Discovery: The Impact of DeepMind’s AI Innovations
Unraveling the Complex World of Proteins with DeepMind’s AlphaFold
At a recent MIT presentation, AI expert Demis Hassabis unveiled DeepMind’s groundbreaking contributions to science, notably through AlphaFold, an AI system revolutionizing protein structure prediction. This technology is redefining our understanding of biology and accelerating drug discovery. Hassabis emphasizes the potential of AI to transform scientific inquiry, underscoring the need for its responsible development and deployment. Demis Hassabis’ close relationship with MIT, where he was a visiting postdoc in 2009, has been instrumental in shaping the research directions of the Center for Brains, Minds, and Machines (CBMM). His regular visits to MIT provide opportunities to share DeepMind’s progress and engage with the academic community.
Hassabis’ presentation focused on AlphaFold, a DeepMind project that predicts protein structures from amino acid sequences. He invited biologists in the audience to share their feedback on AlphaFold and explored potential collaborations.
The Core Mechanics of DeepMind’s AI: From AlphaGo to AlphaZero
DeepMind’s journey began with AlphaGo, an AI program that mastered Go using neural networks and self-play, progressively refining its strategies through iterative improvement. AlphaGo’s neural network was trained through hundreds of thousands or millions of self-play games. The generated data was used to train a new version of the network, V2. V1 and V2 were pitted against each other in mini-tournaments, and V2 replaced V1 if it won by a certain threshold. This process was repeated until V2 consistently beat V1.
Integral to its success was the Monte Carlo tree search, allowing efficient exploration of move possibilities. AlphaGo’s innovation didn’t just stop at learning existing strategies; it demonstrated creativity by developing novel tactics, as seen in its historic match against Lee Sedol. While AI systems excel in interpolation, they face challenges in extrapolation, a frontier yet to be conquered.
A New Era in Chess: AlphaZero’s Influence and Efficiency
Transitioning from specific to general applications, DeepMind introduced AlphaZero, extending the capabilities of AlphaGo to master chess and other games rapidly. This shift towards generalized AI systems highlights the balance between performance and adaptability, a cornerstone of DeepMind’s philosophy.
AlphaZero achieved remarkable results by beating the best programs in various games, including AlphaGo, within a few hours of training from nothing. Generalization can also provide performance benefits, as seen with AlphaZero’s improved strength compared to AlphaGo. AlphaZero’s impact on chess was nothing short of revolutionary. It introduced a novel playing style, emphasizing mobility over material gain and reshaping traditional chess strategies. Its efficient decision-making process, requiring fewer searches than conventional chess engines, marked a significant advancement in AI. The influence of AlphaZero extended beyond AI, inspiring chess champions like Magnus Carlsen to incorporate its strategies into their gameplay. AlphaZero invented a new style of chess, favoring mobility over materiality, and its games were considered beautiful and aesthetically pleasing. AlphaZero sacrificed pieces and pawns to imprison black pieces, leading to the “immortal Zagzwang game,” where no move could improve the black position.
From Games to Science: DeepMind’s Pioneering Exploration
Leveraging the success in gaming AI, DeepMind shifted its focus to scientific problems characterized by vast combinatorial spaces and clear objective functions. This transition was fueled by the availability of ample data and the potential for accurate simulators. Selecting problems with these criteria, DeepMind ventured into various domains, from Google’s product enhancements to tackling the longstanding protein folding challenge.
DeepMind’s approach to engineering involves focusing on performance first and then generalizing the system to make it less specific to a particular task. AlphaZero doesn’t have inbuilt rules and valuation tables like traditional chess engines. It is freer to evaluate positions based on contextual factors and make sacrifices when advantageous. AlphaZero balances different factors more effectively and can dynamically adjust its evaluation based on the position. It requires less search for strong play, making it more efficient than traditional chess engines.
AlphaFold: A Milestone in Understanding Proteins
AlphaFold emerged as a pivotal development in solving the protein folding problem, a crucial area in biology. This system’s ability to predict protein structures with exceptional precision marked a significant leap forward, addressing a challenge that had stumped scientists for decades. The foundation of AlphaFold’s success lies in its innovative architecture, incorporating attention-based neural networks and integrating evolutionary and physical constraints. This breakthrough enabled the prediction of the human proteome’s structures, leading to substantial advancements in biology and medicine.
Determining protein structure experimentally is a time-consuming and laborious process. Christian Anfinsen’s Nobel lecture sparked a 50-year quest to predict protein structure directly from its amino acid sequence. Leventhal’s paradox highlights the computational challenges of protein structure prediction due to the vast number of possible shapes a protein can take. Despite this, proteins spontaneously fold efficiently in nature, indicating the existence of an efficient solution.
AlphaFold’s Global Impact and Future Prospects
DeepMind’s collaboration with EMBL-EBI led to the creation of the AlphaFold Protein Structure Database, a resource now integral to researchers worldwide. The database, with over a million predictions, focuses on advancing drug design, especially for neglected diseases, and exploring complex biological models previously deemed impossible. Looking forward, DeepMind aims to predict structures for all known proteins, with explorations in diverse environments and new methods for protein structure determination.
AlphaFold’s impact on biology has been transformative, enabling researchers to solve long-standing problems and make breakthroughs in drug discovery and protein engineering. The database of predicted structures is a valuable resource for scientists worldwide, accelerating progress in various fields.
AlphaFold2: Achievements, Innovations, and Impact in Protein Structure Prediction
– AlphaFold2’s end-to-end system with iterative recycling stage and direct prediction from amino acid sequence to 3D structure facilitated accurate protein structure prediction.
– Its attention-based neural network inferred implicit graph structure, while evolutionary and physical constraints integrated into its architecture enhanced prediction accuracy.
– AlphaFold2 achieved atomic accuracy at the Cas14 competition, outperforming experimental methods with less than 1 Angstrom error.
– The broad access to the biology community was facilitated through the AlphaFold Protein Structure Database, which contained 330,000 structures and has now expanded to over a million predictions.
– Collaborations with EMBL-EBI and the WHO focused on neglected tropical diseases enabled drug design efforts, while applications in modeling complex biological structures and identifying protein structures for pathogens with pandemic potential showcased its versatility.
Rethinking Biology and the Role of AI
Hassabis views biology as an information processing system, suggesting that AI can play a pivotal role in decoding and manipulating these biological systems. The concept of digital biology, including the creation of virtual cells, is becoming increasingly plausible with advancements like AlphaFold. Moreover, DeepMind’s AI applications extend to quantum chemistry, mathematics, fusion, and genomics, positioning AI as a versatile tool in scientific discovery. Hassabis defines solving intelligence as fundamentally understanding the phenomenon and recreating it artificially to achieve AGI. He believes that a general approach to solving intelligence can lead to algorithms applicable to a wide range of challenging tasks, mirroring the capabilities of the human brain.
AlphaFold, Digital Biology, and the Future of AI in Science
– Researchers are delving into applying AlphaFold to study protein complexes, disordered proteins, point mutations, and protein dynamics, exploring its use in protein design.
– AlphaFold’s success is seen as a proof of concept for digital biology, with the vision of creating virtual cells for in silico predictions.
– Hassabis emphasizes the role of AI as the perfect description language for biology due to its complexity.
– AI’s potential as the ultimate general purpose tool for scientists is highlighted, akin to the Hubble telescope’s impact on cosmology.
AI as a Catalyst for Scientific Breakthroughs
DeepMind’s journey from mastering games to revolutionizing protein structure prediction exemplifies AI’s transformative power in science. As Hassabis envisions, AI stands as the ultimate general-purpose tool, enabling scientists to unravel complex problems and drive innovative discoveries. With responsible development and deployment, AI’s potential to reshape various scientific fields is immense, marking a new era in our quest for knowledge and understanding of the natural world.
AI has evolved from simple logic systems to complex machine learning models, showing potential in solving complex problems and achieving AGI. Balancing innovation and ethical considerations is crucial to ensure AI's positive impact on society....
Demis Hassabis' journey in AI spans from early fascination with chess and game programming to spearheading revolutionary achievements like AlphaFold and GATO, while also emphasizing the ethical development of AI and its potential to expand human knowledge and understanding. Hassabis envisions AI as a tool for humanity's advancement, scientific discovery,...
DeepMind, co-founded by Demis Hassabis, has achieved AI breakthroughs in games, protein folding, and more, while emphasizing ethical considerations and responsible AI development. DeepMind's journey showcases AI's potential for scientific discovery and societal impact....
DeepMind's AlphaFold revolutionized protein structure prediction, while its AI systems achieved breakthroughs in games like Go and chess, leading to ethical considerations and advancements in AI development. DeepMind's approach to AI emphasizes the scientific method, responsible deployment, and treating AI with respect and caution....
AlphaFold revolutionized protein structure prediction and accelerated drug discovery by enabling rapid and accurate determination of protein structures. The impact of AI extends beyond biology, with applications in fields like quantum chemistry, mathematics, and fusion....
Artificial intelligence, particularly machine learning, has revolutionized scientific discovery, with breakthroughs like AlphaGo and AlphaFold2 demonstrating its ability to master complex challenges. Demis Hassabis envisions a future where AI serves as a fundamental language for understanding life, akin to mathematics in physics....
Demis Hassabis' vision for AI involves developing self-learning systems that can adapt and handle unpredicted scenarios, which could potentially surpass human capabilities. AlphaZero, a self-learning system, has demonstrated unparalleled adaptability and efficiency, revolutionizing AI and human understanding in diverse domains....