A database of DEM contact models to aid Reproducible Research and enhance data FAIRness
American University of Armenia, Armenia
Imperial College London, UK
Friday, July 4th, 2025

This presentation is based upon work from COST Action CA22132, supported by COST (European Cooperation in Science and Technology).
COST (European Cooperation in Science and Technology) is a funding agency for research and innovation networks. Our Actions help connect research initiatives across Europe and enable scientists to grow their ideas by sharing them with their peers, boosting their research, career and innovation.


Managing research data is an often forgotten aspect of research.
Traditionally datasets were quite small:
Ad-hoc management no longer sufficient
Many useful datasets have been lost because of poor research data management

For the past two decades there has been significant discussion on the reproducibility crisis in science
Distinction between Reproducibility & Replicability
Reproducibility: Can you get the same answers as me when you analyze my data?
Replicability: Can you get the same answers when you do my experiment and collect your own data?
AI will not make this better!
Dependent on the quality of the data provided to it.

Reproducibility Example
“Here’s the raw data from my simulation. If you read the paper you will see the analysis I did and you should get the same results if you do the exact same analysis.”
Replicability Example
“I ran a DEM simulation of a shear cell. I used a linear contact model with a rolling friction model. Read the paper and you should get a similar flow function if you set-up your simulation exactly the same as me and do the exact same analysis.”
Significant changes in last decade:
The types of data generated have changed
Data management best practices involve the entire data lifecycle from project start to end
Plan: Data Management Plan in grant submissions.
Create: Experiment, Simulation, Survey, Merge, etc…
Document: Describe the data collected in detail. Sooner rather than later!
Use: Analyse/Discover/Collaborate. Document the process.
Preserve: Store for future use. Version Control. Databases & Archives.
Share: Essential for translation of results into knowledge. Open Data? Repositories?
Re-Use: Collaborate / Derive / Develop / Teach / Policy

FAIR is an acronym for Findable, Accessible, Interoperable, Reusable, which are the principles which should apply to scientific data management and guardianship.
Findable: The first part of making data re-useable is to make the data findable. Detailed and accurate metadata is key
Accessible: Data could be openly available or it could require prior authentication and authorisation
Interoperable: Data needs to be able to be used in different programs or workflows
Reusable: Well defined data is essential as it makes it easier to understand and therefore use, combine and/or extend the dataset
The FAIR principles also emphasise machine-actionability
Machine readable and digitally accessible are not the same thing!
What is Machine Actionable?
For example, traditional word processing documents and PDF files are easily read by humans but typically are difficult for machines to interpret.
A machine readable format is a file in a standard computer language (not text) that can be parsed (read) automatically by a web browser or computer system. (e.g.; xml, json).
Some questions to consider about your own simulations:
Guidance on how to assess the “fairness” of your data: https://bit.ly/yourFIP

Any movie fans? You may be familiar with IMDB…

James Bond is an iconic long running series of films



Same original story, differently executed interpretations.
Simulations can be simplified into a set of components that need to be recorded
Each component may have several subcomponents such as material description and the interaction models
Each DEM code has many (different) supported interaction models
Some interaction models may exist in multiple DEM codes

There are two main reasons: Replicability and Interoperability:
There are other reasons:
So how do you track implementations of different models?
Currently no universally accepted Metadata Structure / Ontology for classification of DEM contact models

Databases provide efficient and safe multi-user access to data
Web interface to Relational Database

Initial “alpha” version available at: on-dem-db.onrender.com
Tracking various aspects such as:
Additional requirements were identified:
Image Support (Model schematics)
Need some method of saving & editing submissions
Need to be able to track who has submitted an entry
Entry status (draft/public)
Revision history (who, when, description?)
Easiest method is to provide user accounts with a login
Maintainer / Developer (status & handling multiple)
Some method of interfacing with new common file format being developed
New more robust “production ready” implementation
Django and PostgresqlAdvantages
Django’s model viewsDisadvantages
Colleagues & Collaborators at:

CCC-ParaSolS is a two-year project — January 2025 to December 2026 — funded by the Science and Technology Facilities Council (STFC) to create a Collaborative Computational Community in particulate solids simulations.


If you are in the UK and you do DEM, then you should be part of CCC-ParaSolS!!
