Machine learning system sorts out materials databases

Machine learning system sorts out materials' databases

EPFL and MIT scientists have used machine learning to organize the chemical diversity found in the ever-growing databases for the popular metal-organic framework materials.

Metal-organic frameworks (MOFs) are a class of materials that contain nano-sized pores. These pores give MOFs record-breaking internal surface areas, which can measure up to 7,800 m2 in a single gram of material. As a result, MOFs are extremely versatile and find multiple uses: separating petrochemicals and gases, mimicking DNA, producing hydrogen, and removing heavy metals, fluoride anions, and even gold from water are just a few examples.

Because of their popularity, material scientists have been rapidly developing, synthesizing, studying, and cataloguing MOFs. Currently, there are over 90,000 MOFs published, and the number grows every day. Though exciting, the sheer number of MOFs is actually creating a problem: “If we now propose to synthesize a new MOF, how can we know if it is truly a new structure and not some minor variation of a structure that has already been synthesized?” asks Professor Berend Smit at EPFL Valais-Wallis, which houses a major chemistry department.

To address the issue, Smit teamed up with Professor Heather J. Kulik at the Massachusetts Institute of Technology, and used machine learning to develop a “language” for comparing two materials and quantifying the differences between them. 

Armed with their new “language”, the researchers set off to explore the chemical diversity in MOF databases. “Before, the focus was on the number of structures,” says Smit. “But now, we discovered that the major databases have all kinds of bias towards particular structures. There is no point in carrying out expensive screening studies on similar structures. One is better off in carefully selecting a set of very diverse structures, which will give much better results with far fewer structures.”

Another interesting application is “scientific archeology”: The researchers used their machine learning system to identify the MOF structures that, at the time of the study, were published as very different from the ones that are already known. “So we now have a very simple tool that can tell an experimental group how different their novel MOF is compared to the 90,000 other structures already reported,” says Smit.

The study is published in Nature Communications.

Subscribe to our newsletter

Related articles

Neural network helps doctors explain relapses of heart failure

Neural network helps doctors explain relapses of heart failure

Researchers have developed an algorithm that not only predicts hospital readmissions of heart failure patients, but also tells you why these occur.

Genetic diseases research with quantum computing

Genetic diseases research with quantum computing

Scientists are harnessing the mind-bending potential of quantum computers to help us understand genetic diseases – even before quantum computers are a thing.

AI finds COVID-19 needles in a coronavirus haystack

AI finds COVID-19 needles in a coronavirus haystack

Scientists have assembled a combination of data mining, machine-learning algorithms and compression-based analytics to bring the most useful data to the fore on an office computer.

Using machine learning to estimate COVID-19’s seasonal cycle

Using machine learning to estimate COVID-19’s seasonal cycle

Scientists are launching a project to apply machine learning methods to assess the role of climate variables in disease transmission

Machine learning system crack COVID-19 genome signature

Machine learning system crack COVID-19 genome signature

Using machine learning, a team of Western computer scientists and biologists have identified an underlying genomic signature for 29 different COVID-19 DNA sequences.

AI challenge aims to improve mammography accuracy

AI challenge aims to improve mammography accuracy

AI techniques, used in combination with the evaluation of expert radiologists, improve the accuracy in detecting cancer using mammograms.

Machine learning algorithm detects early stages of Alzheimer's

Machine learning algorithm detects early stages of Alzheimer's

An artificial intelligence-based detects early stages of Alzheimer’s through functional magnetic resonance imaging.

Machine learning comes of age in cystic fibrosis

Machine learning comes of age in cystic fibrosis

Researchers have developed AI technology that offers a glimpse of the future of precision medicine, and unprecedented predictive power to clinicians caring for individuals with the life-limiting condition.

First ever biomimetic tongue surface printed

First ever biomimetic tongue surface printed

Scientists have created synthetic soft surfaces with tongue-like textures for the first time using 3D printing.

Popular articles