Jeff Dean (Google Senior Fellow) – Deep Learning to Solve Challending Problems | ETH Zurich (Sep 2018)


Chapters

00:00:15 Deep Learning: Solving Challenging Problems and Grand Engineering Challenges
00:07:25 Machine Learning's Role in Solving Global Issues
00:10:09 Machine Learning in Healthcare and Robotics
00:20:40 Scientific Discovery Acceleration Through Machine Learning
00:24:10 Automating Machine Learning with TensorFlow
00:30:56 Advances in AutoML and Neural Architecture Search
00:34:02 Accelerating Machine Learning with TPUs
00:41:14 Machine Learning Techniques for System Optimization
00:44:05 Machine Learning Fundamentals and Societal Applications

Abstract

Exploring the Frontiers of Machine Learning: From Deep Learning to Real-World Applications

Introduction: Deep Learning’s Ascendance in Modern Technology

Deep learning, a modern rebranding of artificial neural networks, is experiencing a surge in interest and growth. Google’s research approach includes basic research, infrastructure building, product collaboration, and emerging area exploration. This synergy between foundational research and real-world application is a recurring theme in the evolution of machine learning.

The Exponential Growth of Machine Learning

Machine learning, particularly its subset deep learning, is undergoing an explosive growth, outpacing even Moore’s Law. This is evident in the surging number of research papers and advancements. Deep learning thrives on its ability to solve complex tasks through a collection of trainable mathematical units, showcasing versatility and adaptability.

Revolutionizing Fields with Neural Networks

Neural networks have made remarkable strides in various domains such as computer vision, audio transcription, and language translation. A notable achievement is the drastic reduction of error rates in computer vision, evidenced by the ImageNet contest results, plummeting from 26% in 2011 to just 3% in 2016.

Addressing Grand Challenges with Machine Learning

Deep learning’s potential to solve the U.S. National Academy of Engineering’s grand challenges is significant. Machine learning can significantly contribute to solving global challenges, such as improving urban infrastructure, promoting healthier living, and mitigating climate change. From improving urban infrastructure to revolutionizing self-driving cars, machine learning is at the forefront. Robotics, benefitting from advancements in vision technology, is moving towards more autonomous and adaptable systems, with robotic grasping systems now achieving up to 96% success rates. Self-driving cars are being actively tested and developed, with Waymo leading the way in real-world trials. These cars use sensor data to understand their surroundings, predict object movements, and make safe driving decisions.

Predicting Patient Mortality: Deep Learning in Healthcare

Traditional methods use a limited set of hand-designed features to predict mortality risk, while deep learning approaches utilize all data in the medical record, improving prediction accuracy. This enables earlier assessment of patients’ medical risk. Deep learning can predict patient mortality more accurately using all data in the medical record, allowing for earlier assessment of patients’ medical risk.

Accelerating Quantum Chemistry Simulations

Quantum chemists use computationally intensive simulations to predict molecule properties. Deep learning offers a faster alternative. Training a neural network with simulation data can achieve indistinguishable accuracy, while being 300,000 times faster than the original simulator.

Exploiting Simulators for Scientific Discovery

Replacing simulation components with neural networks can lead to significant speedups, making it feasible to screen large datasets and accelerating scientific discovery.

Healthcare: A Prime Beneficiary of Machine Learning

In healthcare, machine learning, especially in medical imaging diagnosis, stands on the brink of setting new care standards. Models trained on specialist-labeled images are matching or even surpassing human experts, offering insights into patterns previously unnoticed by ophthalmologists. Moreover, the application extends to electronic medical records, enhancing patient outcome predictions and treatment recommendations. Machine learning has tremendous potential in healthcare for improved decision-making. In medical imaging, computer vision techniques aid in diagnostic tasks like diabetic retinopathy screening. An ML model achieved par or better performance compared to US board-certified ophthalmologists in grading retinal images. Advanced models can identify patterns in retinal images that ophthalmologists may not be aware of, providing new insights into patient health. Machine learning models can predict future outcomes for patients based on electronic medical record data, aiding in decision-making and treatment planning. Applications of these ML models include Smart Reply in Gmail, machine translation, and predicting various aspects of patient care using de-identified medical records.

Expanding Machine Learning’s Reach with TensorFlow and AutoML

TensorFlow, Google’s open-source machine learning platform, has catalyzed the democratization of machine learning research and application. Its flexibility has led to diverse applications, from agricultural innovations to healthcare diagnostics. TensorFlow is an open-source machine learning framework that enables users to express machine learning research ideas and deploy them in real-world environments. With over 16 million downloads and a vibrant community, it runs on various platforms, including mobile phones, desktops, and data centers. Complementing this is the advent of AutoML, aiming to make machine learning accessible to non-experts through automated model design and neural architecture search, generating models with unique structures and efficient computational use.

Enhancing Computational Efficiency with TPUs

Google’s development of Tensor Processing Units (TPUs) marks a significant advancement in machine learning hardware. From the inference-focused TPUv1 to the more comprehensive TPUv2 and the enhanced TPUv3 with water cooling for increased performance, these specialized accelerators have revolutionized machine learning research by enabling faster and more efficient training and experimentation.

Machine Learning for System Optimization

Machine learning is not just transforming applications but also systems themselves. Learned indices, utilizing neural networks for key position prediction, are proving more efficient than traditional B-trees. This concept extends to instruction scheduling, register allocation, and various system optimization domains, where learning-based approaches can significantly outperform traditional heuristics.

The Ethical Dimension of AI in Society

As AI permeates various societal aspects, ensuring its interpretability, safety, and fairness becomes paramount. AI systems, especially in critical domains like medical diagnostics, must be transparent and explainable. Moreover, addressing data bias to prevent unfairness in machine learning predictions is an ongoing research focus.

The Future of Machine Learning

In conclusion, machine learning, particularly through deep learning, is making significant strides across various disciplines. The development of specialized hardware like TPUs is enhancing computational efficiency, while applications in fields from healthcare to system optimization showcase the technology’s breadth. However, the responsible and ethical use of AI is crucial to address challenges around interpretability, safety, bias, and privacy, ensuring AI’s beneficial integration into society.

Supplemental Updates from Recent Research and Developments:

TPU Development Journey:

Google’s TPU development has been instrumental in advancing machine learning hardware. TPUv1, designed for inference, enabled low-latency needs. TPUv2, capable of both training and inference, offered increased compute power and scalability. The latest TPUv3, with water cooling and enhanced performance, enables large-scale machine learning problems and quick experimentation results.

Machine Learning for Systems: Learned Index Data Structures:

Learned index data structures, utilizing neural networks for key position prediction, offer significant improvements over traditional B-trees. They can be faster, smaller, and adapt to the system’s usage patterns, leading to enhanced performance and efficiency.

Reflections on the Wide-Ranging Applications and Societal Implications of Machine Learning:

Machine learning’s potential for learning-based systems in various domains, from instruction scheduling to ASIC design, holds promise for high-performance and self-adaptive systems. However, concerns about AI’s safety, bias, interpretability, and privacy require ongoing research and the development of methodologies to ensure the responsible use of learned systems.


Notes by: oganesson