LinkML at a glance#
LinkML is a flexible modeling language that allows you to author schemas (“models”) in YAML that describe the structure of your data. The language is designed to allow for both simple use cases such as describing the column headers in a spreadsheet through to creating a complex interlinked schema.
LinkML is designed to work in harmony with other frameworks, including both semantic RDF-based frameworks, as well as frameworks that are more familiar to developers such as JSON.
Quick links: [ Schemas | Spec | Slides | Notebooks | SchemaSheets | Schema Automator | FAQ: Why LinkML? | GitHub ]
Example schemas: [ PersonInfo | Biolink | LinkML | SSSOM ]
Feature: Rich modeling language#
LinkML offers many features of use to data modelers, while retaining a simple core
Classes can be arranged in inheritance hierarchies
Powerful yet easy to use Semantic enumerations that can optionally be backed by ontologies
Create data models that are independent of a database technology
Develop machine-actionable reporting standards and data dictionaries
Include rich annotations and mappings as part of a model
“Linked Data” ready
All schemas have a corresponding JSON-LD context
Compatibility with RDF tooling, without committing to an RDF stack
Export of data models to OWL Schemas
A bridge between frameworks#
Many frameworks lock you in to a particular view of the world or technology. This can lead to silos, and the need to create mappings and transformations between different representations of the same data; for example, if your JSON documents need to work in concert with your relational database or graph store.
LinkML has many different generators for existing frameworks that allow the translation of a LinkML schema to other frameworks:
Convert to JSON-LD contexts, and instantly port your data to RDF
Convert to JSON-Schema and using JSON-Schema validators
Convert to Python dataclasses or pydantic for easy use within applications
Generate SQL Schemas or SQL Alchemy for use with relational databases
Feature: Generation of documentation and websites#
Using the LinkML toolchain you can go from a schema to a statically hosted searchable website in minutes, with pages for each of your schema elements. Using lightweight namespace registries such as w3id.org you can easily have resolvable URIs for all your concepts.
Showcase:
A rapidly growing toolchain#
LinkML can be thought of as two interlocking parts:
A standard for representing schemas, data dictionaries, standards, and metadata
A reference tool stack for doing things with artefacts that conform to the standard
The core LinkML toolchain is written in Python allows for:
generating downstream schema artefacts, including:
documentation and static sites
code for use by developers (data class in Python, Java, and Typescript, ORMs, enumerations)
conversion between alternate representations like JSON-Schema, SQL DDL, RDF Shapes, Protobuf, …
validation and linting of schemas
data conversion between JSON, TSV, and RDF (where that data conforms to a LinkML schema)
data validation of JSON, TSVs, or RDF using either JSON-Schema, SPARQL, or ShEx
LinkML is part of a growing ecosystem of general purpose tools that make curating, mapping, ingesting, and organizing data much easier
schema-automator bootstraps schemas from existing structured and semi-structured sources
LinkML-OWL allows for generation of complex OWL axioms from datamodels
SchemaSheets converts between spreadsheets and schemas
DataHarmonizer is an ontology-based curation tool that is being adapted to LinkML
We eat our own dogfood!#
The LinkML schema language is itself defined in LinkML, and we use our own toolchain for working with it!
More examples#
See the examples pages