Advances in Computational Modeling and Machine Learning of Cellulosic Biopolymers: A Comprehensive Review.
Sharmi Mazumder, Mohammad Hossein Golbabaei, Ning Zhang
Abstract
Open AccessThe hierarchical structure and multifunctional properties of bio-based cellular materials, particularly cellulose, hemicellulose, and lignin, have attracted increasing attention and interest due to their sustainability and versatility. Recent advances in computational modeling and machine learning strategies have provided transformative insights into the molecular, mechanical, thermal, and electronic behaviors of these biopolymers. This review categorizes the conducted studies based on key material properties and discusses the computational methods utilized, including quantum mechanical approaches, atomistic and coarse-grained molecular dynamics, finite element modeling, and machine learning techniques. For each property, such as structural, mechanical, thermal, and electronic, we have analyzed the progress made in understanding inter- and intra-molecular interactions, deformation mechanisms, phase behavior, and functional performance. For instance, atomistic simulations have shown that cellulose nanocrystals exhibit a highly anisotropic elastic response, with axial elastic moduli ranging from approximately 100 to 200 GPa. Similarly, thermal transport studies have shown that the thermal conductivity along the chain axis (≈5.7 W m-1 K-1) is nearly an order of magnitude higher than that in the transverse direction (≈0.7 W m-1 K-1). In recent years, this research area has also experienced rapid advancement in data-driven methodologies, with the number of machine learning applications for biopolymer systems increasing more than fourfold over the past five years. By bridging multiscale modeling and data-driven approaches, this review aims to illustrate how these techniques can be integrated into a unified framework to accelerate the design and discovery of high-performance bioinspired materials. Eventually, we have discussed emerging opportunities in multiscale modeling and data-driven discovery to outline future directions for the design and application of high-performance bioinspired materials. This review aims to bridge the gap between molecular-level understanding and macroscopic functionality, thereby supporting the rational design of next-generation sustainable materials.