Skip to content

Enum: SequenceAlphabet

Types of sequence alphabets used in bioinformatics

URI: valuesets:SequenceAlphabet

Permissible Values

Value Meaning Description
DNA None Deoxyribonucleic acid alphabet (A, T, G, C)
RNA None Ribonucleic acid alphabet (A, U, G, C)
PROTEIN None Protein/amino acid alphabet (20 standard AAs)
IUPAC_DNA None Extended DNA with IUPAC ambiguity codes
IUPAC_RNA None Extended RNA with IUPAC ambiguity codes
IUPAC_PROTEIN None Extended protein with ambiguity codes and rare AAs
RESTRICTED_DNA None Unambiguous DNA bases only (A, T, G, C)
RESTRICTED_RNA None Unambiguous RNA bases only (A, U, G, C)
BINARY None Binary encoding of sequences

Slots

Name Description
sequence_alphabet Types of sequence alphabets used in bioinformatics

Identifier and Mapping Information

Schema Source

  • from schema: https://w3id.org/linkml/valuesets

LinkML Source

name: SequenceAlphabet
description: Types of sequence alphabets used in bioinformatics
title: Sequence Alphabet Types
from_schema: https://w3id.org/linkml/valuesets
rank: 1000
permissible_values:
  DNA:
    text: DNA
    description: Deoxyribonucleic acid alphabet (A, T, G, C)
  RNA:
    text: RNA
    description: Ribonucleic acid alphabet (A, U, G, C)
  PROTEIN:
    text: PROTEIN
    description: Protein/amino acid alphabet (20 standard AAs)
  IUPAC_DNA:
    text: IUPAC_DNA
    description: Extended DNA with IUPAC ambiguity codes
  IUPAC_RNA:
    text: IUPAC_RNA
    description: Extended RNA with IUPAC ambiguity codes
  IUPAC_PROTEIN:
    text: IUPAC_PROTEIN
    description: Extended protein with ambiguity codes and rare AAs
  RESTRICTED_DNA:
    text: RESTRICTED_DNA
    description: Unambiguous DNA bases only (A, T, G, C)
  RESTRICTED_RNA:
    text: RESTRICTED_RNA
    description: Unambiguous RNA bases only (A, U, G, C)
  BINARY:
    text: BINARY
    description: Binary encoding of sequences