Earlier this year, an algorithm was released online โ€” full and completely free โ€” for academics to use. This algorithm represents one of the most important scientific discoveries of our lifetimes, showcasing the growing potential for AI to alter the course of our civilization. Using deep learning and neural networks the algorithm known as Alpha Fold promises to revolutionize the field of biochemistry. It can help us better understand diseases, formulate medicine, and produce solutions to everything from plastic pollution to excess carbon in the atmosphere. By attempting to solve the protein-folding problem weโ€™ve made an exciting and unprecedented breakthrough that will affect the lives of us all.

3D rendering of a chain of amino acids, or what we call a protein.

The protein-folding problem has been an ongoing obstacle for the past 50 years. It first arose in 1972: a new theory proposed that knowing a proteinโ€™s amino acid sequence should allow you to fully predict its structure.

Proteins arenโ€™t just fundamental to life, they are responsible for almost all the processes that take place inside a cell. All living organisms rely on these complex molecules. In turn, a protein is made of a chain of 20 different amino acids. The interactions between these amino acids determine how the protein will fold into its 3D shape. The shape of a protein plays a large role in determining its function, hence why in biology there is the saying, โ€œstructure is functionโ€. Structure will determine what a protein will do and how it will work. Thus, we might be tempted to conclude that so long as one knows the sequence of a proteinโ€™s amino acids, one can determine its final 3D shape. But therein lies the problem.

Peptides are short chains of 2โ€“50 amino acids.

A single protein can be made of up to 2,000 amino acids. Determining all of their possible structures can take longer than the age of the entire universe. This amounts to some 10ยณโฐโฐ possibilities, meaning that a system capable of predicting how a protein folds will have to use something far more elegant and precise than simple brute force.

Ever since the competition known as the Critical Assessment of protein Structure Prediction (CASP) began in 1994, no competing team has come even close to making accurate predictions. The competition itself consists of hundreds of teams whose algorithms aim to predict the structure of about 100 different proteins from given sequences of amino acids. The structures of the proteins have already been determined experimentally but have not yet been publicly revealed. Predictions made by the teamsโ€™ algorithms are then compared to the experimental results and assessed by a panel of judges. Experimental methods used to determine protein structures include X-ray crystallography and cryo-electron microscopy (cryo-EM): well known but expensive methods of research.

Last year, DeepMindโ€™s Alpha Fold algorithm became the first to make increasingly accurate predictions. The algorithm made strides so astonishing that, to many researchers, the protein-folding problem has essentially been solved.

Alpha Foldโ€™s predictions were on average over 90% accurate in 2020. This is a huge improvement over the average of 40% accuracy achieved by the top performing CASP teams over the last couple of decades. When other teams would score a 75% accuracy rating on moderately difficult proteins, Alpha Fold would score 90%. Even when the algorithm did disagree with experimental results, it wasnโ€™t clear which one was more correct since both allow for a certain margin of error. For many of the Alpha Fold predictions this margin of error was the size of an atom, with the algorithm predicting the exact location of thousands of other atoms in the structure. Overall about two-thirds of the predictions were of the same quality as experimental results.

Accuracy rating (GDT) for the top performing teams at CASP since 2006. Image by DeepMind.

Figuring out a protein structure can take days or even years using experimental methods like nuclear magnetic resonance, X-ray crystallography, and cryo-EM. They are also more labor-intensive and costly, relying on trial and error and expensive machinery. Yet Alpha Fold isnโ€™t meant to replace them. Instead itโ€™s meant to supplement the work of researchers. Already Alpha Fold has helped scientists find structures of proteins that theyโ€™ve been researching for decades, enabling science to move forward where before it had been stalled. The Centre for Enzyme Innovation is using the algorithm to find an enzyme thatโ€™ll help us breakdown single use plastics. Itโ€™s also inspired other teams โ€” such as one from the University of Washington โ€” to improve upon Alpha Fold in order to make it faster and more energy efficient.

Like the computer programs of the 80โ€™s and 90โ€™s, however, the first iteration of Alpha Fold wasnโ€™t very successful. Its accuracy rating for CASP in 2018 was less than 60%. It wasnโ€™t until the second iteration of Alpha Fold that real progress was made using the help of deep learning. Deep learning is a kind of machine learning that imitates the way a human brain might behave, enabling the machine to learn with far less input from human beings than might be required by traditional machine learning.

Neural networks made of nodes make up the backbone of deep learning. There are at least 3 layers of nodes in a neural network: input, output, and hidden layers in between. Data is shared across nodes in the neural network and the machine then makes predictions which it can check against a dataset. Training data helps the machine improve its predictions. In the case of Alpha Foldโ€™s deep learning network, the training data consisted of folded proteins from the Protein Data Bank. Additionally, instead of only having 1 neural network Alpha Fold has 2 networks that work with one another to fold the protein, render a 3D model, and adjust their alignments of amino acids at the end.

Deep learning is a subset of artificial intelligence wherein neural networks help the algorithm process information. Here we see an input layer for the neural network, an output layer, and several hidden layers in between.

Having made Alpha Foldโ€™s code available for academics to use, the implications could be enormous.

Alpha Fold is working with Drugs for Neglected Diseases initiative (DNDi) and has so far enabled new treatments for illnesses, helping to replace toxic drugs that before could kill 1 in 20 patients with much safer medicine. Alpha Fold has also helped detect and prevent sight-threatening diseases of the eye and is helping in the research of antibiotic resistance. New designs could lead to proteins that breakdown toxic waste, or solutions to the problem of carbon capture. The industries it could revolutionize include medicine, agriculture, bioengineering, biotechnology, and food science, though itโ€™s unclear just how many breakthroughs will actually stem from the release of the Alpha Fold algorithm. It may well take a couple of decades for it to unravel its full potential, changing the world in many unexpected ways.

In the end Alpha Foldโ€™s solution to the protein-folding problem was a case of using machines to understand machines โ€” proteins, after all, are nothing more than microscopic machines programmed to transport oxygen, digest food, and everything in between. Itโ€™s a stunning showcase of the power of AI. Artificial Intelligence is this generationโ€™s telescope: an instrument to understand the mysterious phenomena all around us, opening up a new vision of the world.

Contributor

Recently Published

Key Takeaway: A study has found that humble leaders can become more promotable by growing others through a “humility route”. Human capital theory suggests that employees’ value can be enhanced by investing in their knowledge, skills, and abilities. Humble leaders focus on the learning and growth of their followers, creating human capital value for themselves. […]

Top Picks

Key Takeaway: The current economic climate is particularly concerning for young people, who are often financially worse off than their parents. To overcome this, it is important to understand one’s financial attachment style, which can be secure, anxious, or avoidant. Attachment theory, influenced by childhood experiences and education, can help shape one’s relationship with money. […]
Key Takeaway: Wellness culture, which claims to provide happiness and meaning, has been criticized for its superficial focus on superficial aspects like candles and juice cleanses. Psychological research suggests that long-term wellbeing comes from a committed pursuit of both pleasure and meaning. Martin Seligman’s Perma model, which breaks wellbeing into five pillars: positive emotions, engagement, […]
Key Takeaway: Quantum computing, which uses entanglement to represent information, has the potential to revolutionize everyday life. However, the development of quantum computers has been slow due to the need to demonstrate an advantage over classical computers. Only a few notable quantum algorithms have been developed, such as the BB84 protocol and Shor’s algorithm, which […]
Key Takeaway: China’s leaders have declared a GDP growth target of 5% in 2024, despite facing economic problems and a property crisis. The country’s rapid economic growth has been attributed to market incentives, cheap labor, infrastructure investment, exports, and foreign direct investment. However, none of these drivers are working effectively. The government’s determination to deflate […]
Key Takeaway: Neuralink, founded by Elon Musk, aims to implant a brain-computer interface (BCI) in people’s brains, allowing them to control computers or phones by thought alone. This technology holds the promise of alleviating human suffering and allowing people with disabilities to regain lost capacities. However, the long-term aspirations of Neuralink include the ability to […]

Trending

I highly recommend reading the McKinsey Global Instituteโ€™s new report, โ€œReskilling China: Transforming The Worldโ€™s Largest Workforce Into Lifelong Learnersโ€, which focuses on the countryโ€™s biggest employment challenge, re-training its workforce and the adoption of practices such as lifelong learning to address the growing digital transformation of its productive fabric. How to transform the country […]

Join our Newsletter

Get our monthly recap with the latest news, articles and resources.

Login

Welcome to Empirics

We are glad you have decided to join our mission of gathering the collective knowledge of Asia!
Join Empirics