This view is the central gateway to the sequence data. According to the main philosophy, to provide the protein family analysis of whole genomes, all selected sequences are ordered by species. If specific sequences need to be compared, the corresponding SearchModules need to be adjusted, e.g. only specific protein classes of a subset of species should be selected. Below the species level, sequences are ordered by protein family membership. A survey of the genome for a specific protein is thought to be complete if the corresponding genome sequence is almost finished (assembly with high covarage data available) and we have finished our in-depth analysis not being able to identify further homologs of the protein. The sequence view provides the following information to the sequences by clicking on the corresponding symbols:
|Sequence name composed of the species abbreviation (Aa), the protein abbreviation(Bb), and optional protein classifier (Number) and variant designations (Letter, e.g. C).
|A version number. This number is used to track any changes of the sequence and subsequent analyses data.
|Class designation, if the protein sequences have been classified. Unclassified sequences are refered to as orphans.
|This analysis tool provides numbers to basic analyses like amino acid composition, isoelectric point, extinction coefficients, instability index, and aliphatic index.
|Alternatively used names (maybe obsolete names) to the protein name are given if those have been used in publications or in the NCBI database submission.
|Domain representation of the sequence based on pfam, coiled-coil, and prosite pattern predictions. The schematic presentations are scaled to the longest sequence of the protein selection.
|If the mRNA or genomic DNA sequence has been published by a research group, the corresponding publication is given.
|Gene structure representations, with access to WebScipio for comprehensive descriptions.
|Designation as pseudogene.
|Exclamation mark, if the annotated sequence may contain mispredicted exons.
|Accession numbers as well as the protein sequences.