News Release

HYPOD-X: Foundation database released to advance data-driven research in the field of quasicrystals

Peer-Reviewed Publication

Research Organization of Information and Systems

Three datasets comprising HYPOD-X and their data collection procedures

image: 

Three datasets comprising HYPOD-X and their data collection procedures

view more 

Credit: The Institute of Statistical Mathematics

Background
Quasicrystals are materials with unique, non-periodic symmetry that distinguishes them from conventional crystals. Approximant crystals, often regarded as precursor materials closely related to quasicrystals, share similar compositional and structural features but retain periodic atomic arrangements. These materials exhibit distinct physical properties, such as unique temperature-dependencies in electrical and thermal conductivity compared to conventional metals. However, the lack of a comprehensive database has long been a significant barrier to advancing machine-learning-driven quasicrystal research. Furthermore, to deepen our understanding of the relationship between quasicrystal structures and their properties —and to stimulate the development of new materials — there is a growing need for a comprehensive open database.

Research Content and Results
The research group has developed the world’s first open database for quasicrystals and their approximants, called “HYPOD-X” (Hypermaterials Open Database for X, where X represents a wildcard for application targets, such as machine learning). HYPOD-X provides structured data on the composition, structure, and physical properties of quasicrystals and approximant crystals, extracted from texts and figures in scientific papers and books, in an accessible format for researchers and engineers. This database serves as a foundation for data-driven researches in the field of quasicrystal research.
As shown in Figure 1, HYPOD-X comprises three datasets: the composition dataset, the phase diagram dataset, and the property dataset. The data, which have been manually or semi-automatically extracted, undergo rigorous expert review before being added to the database.
The composition dataset serves as a foundational source of information on quasicrystals and approximants. The data, including compositions, structural types, and heat treatment conditions, have been manually collected and submitted into the database after rigorous validation by experts. Automated algorithms for error data extraction has also served to enhance data quality. The data volume is approximately ten times greater than that of a previous study [1] that complied the compositions of quasicrystals. Using this dataset, the research group successfully discovered new quasicrystals with a machine learning algorithm called TSAI 1.0 [2].
The properties dataset includes temperature-dependent data for thermal conductivity, electrical properties, and magnetic properties, extracted from figures and tables in scientific papers and books. By analyzing this data, new patterns that have been overlooked even by experts in quasicrystals could be discovered. For instance, quasicrystals tend to exhibit an increase in thermal conductivity at higher temperatures, which has not been typically observed in conventional metals or crystals. This unique property could be utilized in the development of thermal rectifying materials that control the heat flow in specific directions. Identifying quasicrystals with favorable promising temperature dependencies from this dataset may accelerate the development of new thermal management devices.
The phase diagram dataset contains digitalized data extracted from figures in the vast literature to date. Specifically, it stores data quantifying the boundary composition of each phase region, providing compositional ranges and other conditions under which quasicrystals and approximant crystals are thermodynamically stabilized. Applying machine learning to this dataset enables the prediction of new phases for quasicrystals and approximant crystals [2].

Future prospects
HYPOD-X offers a valuable new resource to advance quasicrystal research. The research group plans to continually expand the database. While data-driven research is becoming popular across various fields of materials science, the limited availability of data has hindered progress of data-driven quasicrystal research. With the launch of HYPOD-X, a diverse array of data-driven research is expected to arise. Furthermore, by providing a comprehensive view of extensive data, it is anticipated that new insights and scientific principles will be discovered in quasicrystal science.

Published paper
Title: Comprehensive experimental datasets of quasicrystals and their approximants
Authors: Erina Fujita1,2, Chang Liu1, Asuka Ishikawa3, Tomoya Mato2, Koichi Kitahara4, Ryuji Tamura3, Kaoru Kimura1,2, Ryo Yoshida1,2,5, Yukari Katsura2,6,7
Journal: Scientific Data
DOI: 10.1038/s41597-024-04043-z
Published date: 2024/11/13

1. The Institute of Statistical Mathematics, 2. National Institute for Materials Science, 3. Tokyo University of Science, 4. National Defense Academy, 5. SOKENDAI, 6. Tsukuba University, 7. RIKEN

References
[1] W. Steurer and S. Deloudi, Crystallography of Quasicrystals, Springer Series in Materials Science 126 (Springer, Berlin, Heidelberg, 2009).
[2] C. Liu, K. Kitahara, A. Ishikawa, T. Hiroto, A. Singh, E. Fujita, Y. Katsura, Y. Inada, R. Tamura, K. Kimura, and R. Yoshida, Phys. Rev. Materials 7, 093805 (2023).

Acknowledgements
This work was supported by a MEXT KAKENHI Grant-in-Aid for Scientific Research in Innovative Areas (19H05817, 19H05818, 19H05820) and JST CREST (JPMJCR22O3).

###

About The Institute of Statistical Mathematics (ISM)
The Institute of Statistical Mathematics (ISM) is part of Japan's Research Organization of Information and Systems (ROIS). With more than 75 years of history, the institute is an internationally renowned facility for research on statistical mathematics including comprehensive evaluation of earthquake data in Japan and other parts of the world. ISM comprises three different departments including the Department of Statistical Modeling, the Department of Statistical Data, and the Department of Statistical Inference and Mathematics, as well as several key data and research centers. Through the efforts of various research departments and centers, ISM aims to continuously facilitate cutting edge research collaboration with universities, research institutions, and industries both in Japan and other countries.

About the Research Organization of Information and Systems (ROIS)
ROIS is a parent organization of four national institutes (National Institute of Polar Research, National Institute of Informatics, the Institute of Statistical Mathematics and National Institute of Genetics) and the Joint Support-Center for Data Science Research. It is ROIS's mission to promote integrated, cutting-edge research that goes beyond the barriers of these institutions, in addition to facilitating their research activities, as members of inter-university research institutes.


Disclaimer: AAAS and EurekAlert! are not responsible for the accuracy of news releases posted to EurekAlert! by contributing institutions or for the use of any information through the EurekAlert system.