biocantor.io.genbank.constants

GenBank parsing constants. Records common feature types and their relationships as enumerations.

Module Contents

Classes

GenBankParserType

Currently implemented types of GenBank parsing. These types of files are not necessarily mutually exclusive,

MetadataFeatures

GenBank metadata features BioCantor understands.

GeneFeatures

GenBank gene features BioCantor understands.

FeatureCollectionFeatures

GenBank feature collections BioCantor understands.

TranscriptFeatures

GenBank transcript features types BioCantor understands.

NonCodingTranscriptFeatures

Non-coding GenBank transcript features types BioCantor understands.

GeneIntervalFeatures

GenBank interval features types BioCantor understands. These do not match

FeatureIntervalFeatures

GenBank interval features types BioCantor understands. These do not match

KnownQualifiers

GenBank qualifiers that have special meaning

GenbankFlavor

GenBank files are formatted differently at NCBI if the species is prokaryotic or eukaryotic. The main

Attributes

GENBANK_GENE_FEATURES

class biocantor.io.genbank.constants.GenBankParserType

Bases: enum.IntEnum

Currently implemented types of GenBank parsing. These types of files are not necessarily mutually exclusive, but the parsing implementations are different.

SORTED = 1
LOCUS_TAG = 2
HYBRID = 3
class biocantor.io.genbank.constants.MetadataFeatures

Bases: str, inscripta.biocantor.util.enum.HasMemberMixin

GenBank metadata features BioCantor understands.

SOURCE = source
class biocantor.io.genbank.constants.GeneFeatures

Bases: str, inscripta.biocantor.util.enum.HasMemberMixin

GenBank gene features BioCantor understands.

GENE = gene
class biocantor.io.genbank.constants.FeatureCollectionFeatures

Bases: str, inscripta.biocantor.util.enum.HasMemberMixin

GenBank feature collections BioCantor understands.

FEATURE_COLLECTION = misc_feature
class biocantor.io.genbank.constants.TranscriptFeatures

Bases: str, inscripta.biocantor.util.enum.HasMemberMixin

GenBank transcript features types BioCantor understands.

CODING_TRANSCRIPT = mRNA
NONCODING_TRANSCRIPT = ncRNA
TRANSFER_RNA = tRNA
RIBOSOMAL_RNA = rRNA
MISC_RNA = misc_RNA
TM_RNA = tmRNA
class biocantor.io.genbank.constants.NonCodingTranscriptFeatures

Bases: str, inscripta.biocantor.util.enum.HasMemberMixin

Non-coding GenBank transcript features types BioCantor understands.

NONCODING_TRANSCRIPT = ncRNA
TRANSFER_RNA = tRNA
RIBOSOMAL_RNA = rRNA
MISC_RNA = misc_RNA
TM_RNA = tmRNA
class biocantor.io.genbank.constants.GeneIntervalFeatures

Bases: str, inscripta.biocantor.util.enum.HasMemberMixin

GenBank interval features types BioCantor understands. These do not match

BioCantorFeatureTypes because GenBank has length limitations on feature types.

CDS = CDS
EXON = exon
class biocantor.io.genbank.constants.FeatureIntervalFeatures

Bases: str, inscripta.biocantor.util.enum.HasMemberMixin

GenBank interval features types BioCantor understands. These do not match

BioCantorFeatureTypes because GenBank has length limitations on feature types.

FEATURE_INTERVAL = feat_interval
class biocantor.io.genbank.constants.KnownQualifiers

Bases: str, enum.Enum

GenBank qualifiers that have special meaning

GENE = gene
LOCUS_TAG = locus_tag
GENE_ID = gene_id
TRANSCRIPT_ID = transcript_id
PROTEIN_ID = protein_id
PRODUCT = product
GENE_NAME = gene_name
GBKEY = gbkey
DBXREF = db_xref
GENE_SYNONYM = gene_synonym
CODON_START = codon_start
FEATURE_CLASS = feature_class
biocantor.io.genbank.constants.GENBANK_GENE_FEATURES
class biocantor.io.genbank.constants.GenbankFlavor

Bases: enum.Enum

GenBank files are formatted differently at NCBI if the species is prokaryotic or eukaryotic. The main difference is the presence of a transcript level feature on eukaryotic genomes.

PROKARYOTIC = 1
EUKARYOTIC = 2