Skip to content

Commit 46ae23e

Browse files
committed
Align lecture site style and top links with DSfEM WS25
1 parent 634a5e2 commit 46ae23e

19 files changed

Lines changed: 101 additions & 8 deletions

.gitignore

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -8,3 +8,5 @@
88
/trackchanges.sty
99

1010
/.luarc.json
11+
12+
**/*.quarto_ipynb

_freeze/index/execute-results/docx.json

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,8 +1,8 @@
11
{
2-
"hash": "7ceb29e1a1f11e2aa58236ca30b45b85",
2+
"hash": "47531608e9a28490b70a10a8eba0b468",
33
"result": {
44
"engine": "jupyter",
5-
"markdown": "---\ntitle: Materials Genomics\nsubtitle: Computational Materials Discovery\nauthor:\n - name: Philipp Pelz\n corresponding: true\nkeywords:\n - Materials Science\n - Machine Learning\n - Computational Materials Discovery\n - Materials Databases\n - Crystal Structure\nabstract: |\n This course introduces students to materials genomics, treating the periodic table and all known crystal structures as a searchable, computable design space. Students learn how materials databases are built, how to represent matter as numbers, graphs, or fingerprints, how to interrogate and predict properties of solids, how to use ML as a surrogate for quantum mechanics, and how to design new materials algorithmically.\ndate: last-modified\nbibliography: references.bib\nnumber-sections: true\njupyter: python3\n---\n\n## Course Information\n\n**4th Semester – 5 ECTS · 2h lecture + 2h exercises per week, together with ML for Materials Processing & Characterization**\n \n## Course Philosophy\n\nMaterials genomics treats the periodic table and all known crystal structures as a giant searchable, computable design space.\n\nStudents learn:\n\n- how materials databases are built,\n- how to represent matter as numbers, graphs, or fingerprints,\n- how to interrogate and predict properties of solids,\n- how to use ML as a surrogate for quantum mechanics,\n- how to design new materials algorithmically.\n\n## Week-by-Week Curriculum (14 weeks)\n\n### Unit I — Foundations of Materials Genomics (Weeks 1–3)\n\n#### Week 1 – What is Materials Genomics?\n\n- Genomics analogy: genes → functions vs atoms → properties.\n- Brief history: AFLOW, OQMD, Materials Project, NOMAD.\n- PSPP from the structure-first viewpoint.\n\n**Exercise:** Explore Materials Project; query bandgaps, energies, symmetries.\n\n#### Week 2 – Crystal structure fundamentals\n\n- Space groups, Wyckoff positions, symmetry operations.\n- How symmetry informs descriptors.\n\n**Exercise:** Using pymatgen / spglib to analyze symmetries.\n\n#### Week 3 – Materials databases & file formats\n\n- CIF, POSCAR, PDB-like formats.\n- Thermodynamic quantities in databases: formation energy, stability, convex hull.\n\n**Exercise:** Parse CIF files, extract primitive cells, compute density.\n\n\n### Unit II — Representations of Materials (Weeks 4–6)\n\n#### Week 4 – Classical descriptors & materials fingerprints\n\n- Magpie, matminer.\n- Stoichiometric, elemental, and structural features.\n\n**Exercise:** Build a small property regressor with Magpie features.\n\n#### Week 5 – Graph-based representations\n\n- Crystal structures as graphs: nodes, edges, periodic boundary conditions.\n- CGCNN, MEGNet architecture intuition (no training from scratch yet).\n\n**Exercise:** Build a simple CGCNN-like graph featurizer.\n\n#### Week 6 – Local atomic environments\n\n- Voronoi tessellations, coordination numbers, SOAP descriptors.\n- Role in interatomic potentials and ML force fields.\n\n**Exercise:** Compute SOAP vectors; perform clustering in descriptor space.\n\n### Unit III — High-Throughput Computation & Screening (Weeks 7–9)\n\n#### Week 7 – Quantum mechanical data and DFT basics\n\n- What DFT gives you: energies, forces, band structures, elastic constants.\n- Why it's expensive; why ML surrogates matter.\n\n**Exercise:** Run a toy DFT calculation (Quantum Espresso or MP workflows).\n\n#### Week 8 – High-throughput workflows\n\n- Automation: pymatgen, custodian, FireWorks, Atomate.\n- Data generation for building surrogate models.\n\n**Exercise:** Perform a small FireWorks workflow (or simulate the idea without cluster resources).\n\n#### Week 9 – Phase stability & the convex hull\n\n- Formation energies, metastability, hull distance.\n- Mapping an entire chemical system.\n\n**Exercise:** Reconstruct phase diagrams from Materials Project data.\n\n### Unit IV — Learning Properties from Atomic Structure (Weeks 10–12)\n\n#### Week 10 – Regression on crystal data\n\n- Predicting bandgaps, hardness, elastic moduli.\n- Comparing different representation families.\n\n**Exercise:** Benchmark random forest, GPR, CGCNN on a small dataset.\n\n#### Week 11 – Machine-learned interatomic potentials\n\n- Overview: GAP, SNAP, MTP, NequIP.\n- Role in simulating defects, diffusion, mechanical behavior.\n\n**Exercise:** Fit a tiny ML potential (ACE or simple SNAP-style) to toy data.\n\n#### Week 12 – Generative models for materials\n\n- VAEs, diffusion models for crystal generation.\n- Constraints: symmetry, stability, charge neutrality.\n\n**Exercise:** Sample a generative model from a pretrained online source; analyze validity.\n\n### Unit V — Mini-Project & Synthesis (Weeks 13–14)\n\n#### Week 13 – Project workshop\n\n**Example projects:**\n\n- Predict bandgap from composition + structure representation.\n- Identify new stable compounds in a chemical system.\n- Build a graph-based model for elastic constants.\n- Use ML to approximate formation energies for a ternary subsystem.\n- Analyze SOAP fingerprints across polymorphs.\n\n#### Week 14 – Presentations & Reflection\n\n- Interpreting models: SHAP for materials descriptors.\n- Strengths/limitations of materials genomics vs experiment-driven ML.\n- How computational and experimental ML meet in modern labs.\n \n## Learning Outcomes\n\nStudents completing this course will be able to:\n\n- Navigate major materials databases and extract relevant structural/property data.\n- Represent crystals numerically using descriptors, fingerprints, and graphs.\n- Train ML models to predict quantum-mechanical and thermodynamic properties.\n- Analyze structural features via symmetry, coordination, and environments.\n- Perform high-throughput screening of materials candidates.\n- Understand and apply generative models for inorganic crystals.\n- Critically evaluate ML results in computational materials discovery.\n\n",
5+
"markdown": "---\ntitle: Materials Genomics\nsubtitle: Computational Materials Discovery\nauthor:\n - name: Philipp Pelz\n corresponding: true\nkeywords:\n - Materials Science\n - Machine Learning\n - Computational Materials Discovery\n - Materials Databases\n - Crystal Structure\nabstract: |\n This course introduces students to materials genomics, treating the periodic table\n and the space of known crystal structures as a searchable, computable design space.\n Students learn how materials databases are built, how atomic structure is represented\n numerically, how structure–property relationships are learned using machine learning,\n and how uncertainty-aware models enable accelerated materials discovery.\ndate: last-modified\nbibliography: references.bib\nnumber-sections: true\njupyter: python3\n---\n\n\n:::: {.columns}\n\n::: {.column width=\"50%\"}\n[![](images/eclipse_header.png){width=300}](https://pelzlab.science)\n:::\n\n::: {.column width=\"25%\"}\n[![](images/github.jpeg){width=120}](https://github.com/ECLIPSE-Lab/MaterialsGenomics)\n\n**Github Code**\n:::\n\n::: {.column width=\"25%\"}\n[![](images/studon.jpeg){width=120}](https://www.studon.fau.de/)\n\n**Studon Link**\n:::\n\n::::\n\n## Course Information\n\n**4th/5th Semester – 5 ECTS · 2h lecture + 2h exercises per week** \n*Coordinated with “Mathematical Foundations of AI & ML” (MFML) and \n“ML for Materials Processing & Characterization” (ML-PC)*\n\n---\n\n## Course Philosophy\n\nMaterials genomics views the periodic table and all known crystal structures as a\n**high-dimensional design space**.\n\nIn this course, students learn to:\n\n- treat materials data as a structured, learnable representation space,\n- move beyond classical descriptors toward learned representations,\n- use ML models as surrogates for quantum-mechanical calculations,\n- reason about uncertainty, stability, and discovery,\n- understand how computational screening integrates with experiments.\n\nThe course explicitly **builds on MFML**:\n\n- PCA and regression are assumed background,\n- neural networks, representation learning, and uncertainty are used, not re-derived.\n\n---\n\n## Week-by-Week Curriculum (14 weeks)\n\n### Unit I — Materials Data as a Design Space (Weeks 1–3)\n\n#### Week 1 – What is Materials Genomics?\n\n- Genomics analogy: genes → functions vs atoms → properties.\n- Structure–property–processing paradigm from a *structure-first* viewpoint.\n- Overview of major databases: Materials Project, OQMD, AFLOW, NOMAD.\n\n**Exercise:** \nExplore Materials Project; query bandgaps, formation energies, symmetries.\n\n---\n\n#### Week 2 – Crystal structures, symmetry, and low-dimensional structure\n\n- Crystal structures as data objects.\n- Space groups, Wyckoff positions, symmetry constraints.\n- PCA as an *exploratory tool* for structural/property data (refresher).\n\n**Exercise:** \nUse pymatgen/spglib to analyze symmetry; visualize PCA of structural features.\n\n---\n\n#### Week 3 – Materials databases & thermodynamic quantities\n\n- File formats: CIF, POSCAR, database schemas.\n- Formation energies, convex hulls, metastability.\n- What databases do *not* contain (bias, incompleteness).\n\n**Exercise:** \nParse CIF files; compute basic structural properties; analyze stability.\n\n---\n\n### Unit II — Representations of Materials (Weeks 4–6)\n\n*(Aligned with early neural networks in MFML)*\n\n#### Week 4 – From classical descriptors to learned representations\n\n- Classical descriptors: Magpie, matminer (composition-based).\n- Limits of hand-crafted features.\n- Why representation learning matters.\n\n**Exercise:** \nBuild a simple property predictor using classical descriptors.\n\n---\n\n#### Week 5 – Graph-based crystal representations\n\n- Crystals as graphs: nodes, edges, periodicity.\n- Intuition behind CGCNN, MEGNet (no architecture deep dive).\n- Relation to MFML neural network concepts.\n\n**Exercise:** \nConstruct a graph representation of crystals; visualize connectivity.\n\n---\n\n#### Week 6 – Local atomic environments\n\n- Local vs global representations.\n- Coordination environments, Voronoi tessellations.\n- SOAP descriptors as a bridge to learned representations.\n\n**Exercise:** \nCompute SOAP vectors; cluster structures in environment space.\n\n---\n\n### Unit III — Learning Structure–Property Relations (Weeks 7–9)\n\n#### Week 7 – Regression and generalization in materials data\n\n- Predicting bandgaps, elastic moduli, formation energies.\n- Bias–variance and overfitting in materials datasets.\n- Dataset size vs model complexity.\n\n**Exercise:** \nCompare linear, random forest, and NN regressors on a materials dataset.\n\n---\n\n#### Week 8 – Neural networks for materials properties\n\n- Neural networks as universal surrogates for DFT-level properties.\n- Training pitfalls: data leakage, imbalance, extrapolation.\n- Physical interpretability concerns.\n\n**Exercise:** \nTrain a small NN for property prediction; analyze overfitting.\n\n---\n\n#### Week 9 – Representation learning and feature discovery\n\n- Learned vs engineered features.\n- What networks “learn” about chemistry and structure.\n- Transferability across chemical systems.\n\n**Exercise:** \nCompare performance using raw descriptors vs learned embeddings.\n\n---\n\n### Unit IV — Latent Spaces, Uncertainty, and Discovery (Weeks 10–12)\n\n#### Week 10 – Latent spaces of materials\n\n- Autoencoders and embeddings for crystal data.\n- Interpreting latent dimensions.\n- Relation to chemical intuition and structure families.\n\n**Exercise:** \nTrain an autoencoder; visualize latent materials space.\n\n---\n\n#### Week 11 – Clustering vs discovery in materials spaces\n\n- Why clustering ≠ discovery.\n- Structure in latent space.\n- Identifying families, outliers, and anomalies.\n\n**Exercise:** \nCompare k-means clustering with latent-space organization.\n\n---\n\n#### Week 12 – Uncertainty-aware discovery & Gaussian Processes\n\n- Aleatoric vs epistemic uncertainty.\n- Gaussian Process regression as a gold standard for uncertainty.\n- Exploration vs exploitation in materials screening.\n- Relevance to materials acceleration platforms.\n\n**Exercise:** \nGP regression vs NN ensembles; visualize uncertainty-driven screening.\n\n---\n\n### Unit V — Constraints, Trust, and Synthesis (Weeks 13–14)\n\n#### Week 13 – Physical constraints and informed learning\n\n- Stability, charge neutrality, symmetry constraints.\n- Physics-informed ML in materials discovery.\n- Failure modes of unconstrained models.\n\n**Exercise:** \nTrain a constrained model using penalty-based approaches.\n\n---\n\n#### Week 14 – Integration, limits, and outlook\n\n- Explainability of materials ML models.\n- What ML can and cannot discover.\n- How computational genomics meets experiment-driven workflows.\n\n**Exercise:** \nMini-project synthesis and presentation.\n\n---\n\n## Learning Outcomes\n\nStudents completing this course will be able to:\n\n- Navigate and interrogate major materials databases.\n- Represent crystal structures using descriptors, graphs, and learned embeddings.\n- Train and evaluate ML models for predicting materials properties.\n- Understand latent spaces and their role in materials discovery.\n- Quantify and interpret uncertainty in materials predictions.\n- Apply ML to accelerate materials screening responsibly.\n- Critically assess the limits of data-driven materials discovery.\n\n",
66
"supporting": [
77
"index_files/figure-docx"
88
],

0 commit comments

Comments
 (0)