Home Blog Newsfeed AI Predicts Protein Location Inside Human Cells, Revolutionizing Disease Diagnosis
AI Predicts Protein Location Inside Human Cells, Revolutionizing Disease Diagnosis

AI Predicts Protein Location Inside Human Cells, Revolutionizing Disease Diagnosis

In a groundbreaking advancement, researchers at MIT, Harvard University, and the Broad Institute have developed a novel AI-driven approach capable of predicting the precise location of virtually any protein within a human cell. This innovation holds immense potential for revolutionizing disease diagnosis, identifying drug targets, and deepening our understanding of complex biological processes.

The challenge lies in the sheer number of proteins—approximately 70,000—within a single human cell. Manually identifying the location of each protein is an extremely time-consuming and costly endeavor. Existing computational techniques, while helpful, often fall short in exploring the vast possibilities.

The newly developed method overcomes these limitations by accurately predicting protein location, even for proteins and cell lines that have never been tested before. Moreover, it operates at the single-cell level, providing detailed insights into protein localization within individual cells, which is a significant leap beyond averaged estimates.

This research combines a protein language model with a sophisticated computer vision model to capture intricate details about both the protein and the cell. The user inputs the amino acid sequence of the protein and three stained cell images (nucleus, microtubules, and endoplasmic reticulum), and the AI delivers an image highlighting the predicted protein location within the cell.

Yitong Tseo, a graduate student at MIT, highlights the potential of this tool to significantly reduce lab work: “You could do these protein-localization experiments on a computer without having to touch any lab bench, hopefully saving yourself months of effort. While you would still need to verify the prediction, this technique could act like an initial screening of what to test for experimentally.”

The researchers trained their model, named PUPS, using a two-part method. The first part analyzes the protein sequence to determine its localization properties and 3D structure. The second part uses an image inpainting model to gather information about the cell’s state from stained images.

During training, PUPS is assigned a secondary task to explicitly name the compartment of localization (e.g., cell nucleus). This helps the model improve its overall understanding of cell compartments and how proteins interact with them.

Xinyi Zhang, a graduate student at MIT, emphasizes the uniqueness of their approach: “Most other methods usually require you to have a stain of the protein first, so you’ve already seen it in your training data. Our approach is unique in that it can generalize across proteins and cell lines at the same time.”

The research team validated PUPS’s predictive capabilities through laboratory experiments, confirming its ability to accurately predict the subcellular location of new proteins in unseen cell lines.

Looking ahead, the researchers aim to enhance PUPS to understand protein-protein interactions and make localization predictions for multiple proteins within a cell. Ultimately, they envision applying PUPS to living human tissue, further expanding its potential impact on biological research and medical applications.

Add comment

Sign Up to receive the latest updates and news

Newsletter

© 2025 Proaitools. All rights reserved.