ExPASy logo ExPASy Home page Site Map Search ExPASy Contact us Swiss-Prot
Notice: This page will be replaced with www.uniprot.org. Please send us your feedback!
Search for

UniProt Knowledgebase
Swiss-Prot Protein Knowledgebase
TrEMBL Protein Database

Release notes
UniProtKB release 14.0 of 22-Jul-2008

Content

  Introduction
  UniProtKB/Swiss-Prot Protein Knowledgebase release statistics
  UniProtKB/TrEMBL Protein Database release statistics

  Submissions and Updates
  Download information
  Contact
  Citation

  Related documents: UniProtKB user manual, Recent changes, Forthcoming changes.

Introduction

Release 14.0 of the UniProt Knowledgebase is composed of the UniProtKB/Swiss-Prot Protein Knowledgebase release 56.0 and the UniProtKB/TrEMBL Protein Database release 39.0.

More information on these databases can be found in the user manual What is the UniProt Knowledgebase ?.


UniProtKB/Swiss-Prot protein knowledgebase release 56.0 statistics

Release 56.0 of 22-Jul-08 of UniProtKB/Swiss-Prot contains 392'667 sequence entries, comprising 141'217'034 amino acids abstracted from 172'036 references.

The growth of the database is summarized below.

Release Date Number of entries Number of amino acids
2.0 09/86 3'939 900'163
3.0 11/86 4'160 969'641
4.0 04/87 4'387 1'036'010
5.0 09/87 5'205 1'327'683
6.0 01/88 6'102 1'653'982
7.0 04/88 6'821 1'885'771
8.0 08/88 7'724 2'224'465
9.0 11/88 8'702 2'498'140
10.0 03/89 10'008 2'952'613
11.0 07/89 10'856 3'265'966
12.0 10/89 12'305 3'797'482
13.0 01/90 13'837 4'347'336
14.0 04/90 15'409 4'914'264
15.0 08/90 16'941 5'486'399
16.0 11/90 18'364 5'986'949
17.0 02/91 20'024 6'524'504
18.0 05/91 20'772 6'792'034
19.0 08/91 21'795 7'173'785
20.0 11/91 22'654 7'500'130
21.0 03/92 23'742 7'866'596
22.0 05/92 25'044 8'375'696
23.0 08/92 26'706 9'011'391
24.0 12/92 28'154 9'545'427
25.0 04/93 29'955 10'214'020
26.0 07/93 31'808 10'875'091
27.0 10/93 33'329 11'484'420
28.0 02/94 36'000 12'496'420
29.0 06/94 38'303 13'464'008
30.0 10/94 40'292 14'147'368
31.0 02/95 43'470 15'335'248
32.0 11/95 49'340 17'385'503
33.0 02/96 52'205 18'531'384
34.0 10/96 59'021 21'210'389
35.0 11/97 69'113 25'083'768
36.0 07/98 74'019 26'840'295
37.0 12/98 77'977 28'268'293
38.0 07/99 80'000 29'085'965
39.0 05/00 86'593 31'411'114
40.0 10/01 101'602 37'315'215
41.0 02/03 122'564 44'986'459
42.0 10/03 135'850 50'046'799
43.0 03/04 146'720 54'093'154
44.0 07/04 153'871 56'608'159
45.0 10/04 163'235 59'631'787
46.0 02/05 168'297 61'443'278
47.0 05/05 181'577 65'746'672
48.0 09/05 194'317 70'391'852
49.0 02/06 207'132 75'438'310
50.0 05/06 222'289 81'585'146
51.0 10/06 241'242 88'541'632
52.0 03/07 261'513 95'638'062
53.0 05/07 269'293 98'902'758
54.0 07/07 276'256 101'466'206
55.0 02/08 356'194 127'836'513
56.0 07/08 392'667 141'217'034

In rare cases, UniProtKB/Swiss-Prot entries are removed. Deleted entries are almost exclusively Open Reading Frames (ORFs) that have been wrongly predicted to code for proteins. When there is enough evidence that these hypothetical proteins are not real we take the decision to remove them from UniProtKB/Swiss-Prot. In the document delac_sp.txt, you will find a list of all accession numbers which were previously present in UniProtKB/Swiss-Prot, but which have now been deleted from the database.


Status of the model organisms

We have selected a number of organisms that are the target of genome sequencing and/or mapping projects and for which we intend to:

From our efforts to annotate human sequence entries as completely as possible arose the HPI project, and the bacterial model organisms became the focus of the HAMAP project. Here is the current status of the model organisms which are not covered by these two projects:

Organism Database cross-references Index file Number of sequences
A.thaliana TAIR arath.txt 6'914
C.albicans None yet calbican.txt 727
C.elegans Wormpep celegans.txt 3188
D.discoideum DictyBase dicty.txt 2'479
D.melanogaster FlyBase fly.txt 2'817
M.musculus MGD mgdtosp.txt 15'813
S.cerevisiae SGD yeast.txt 6'553
S.pombe GeneDB_SPombe pombe.txt 4'421

UniProtKB/Swiss-Prot release statistics
1.  INTRODUCTION

Release 56.0 of 22-Jul-08 of UniProtKB/Swiss-Prot contains 392667 sequence entries,
comprising 141217034 amino acids abstracted from 172036 references. 

36631 sequences have been added since release 55.0, the sequence data of
605 existing entries has been updated and the annotations of
356036 entries have been revised.

Number of fragments: 8097
Number of additional sequences produced by alternative splicing, initiation or promoter usage, or ribosomal frameshifting: 26036


Protein existence:
PE 1: Evidence at protein level    60013 entries
PE 2: Evidence at transcript level 63043 entries
PE 3: Inferred from homology       255230 entries
PE 4: Predicted                    13153 entries
PE 5: Uncertain                    1228 entries


2.  AMINO ACID COMPOSITION

   2.1  Composition in percent for the complete database

   Ala (A) 8.13   Gln (Q) 3.95   Leu (L) 9.67   Ser (S) 6.67
   Arg (R) 5.50   Glu (E) 6.73   Lys (K) 5.88   Thr (T) 5.35
   Asn (N) 4.05   Gly (G) 7.04   Met (M) 2.41   Trp (W) 1.09
   Asp (D) 5.40   His (H) 2.28   Phe (F) 3.88   Tyr (Y) 2.93
   Cys (C) 1.42   Ile (I) 5.92   Pro (P) 4.77   Val (V) 6.82

   Asx (B) 0.000  Glx (Z) 0.000  Xaa (X) 0.00


   2.2  Classification of the amino acids by their frequency

   Phe, Tyr, Met, His, Cys, Trpla, Gly, Val, Glu, Ser, Ile, Lys, Arg, Asp, Thr, Pro, Asn, Gln,
   Phe, Tyr, Met, His, Cys, Trp


3.  TAXONOMIC ORIGIN

   Total number of species represented in this release of UniProtKB/Swiss-Prot: 11471

   The first twenty species represent 98378 sequences: 25.1 % of the total
   number of entries.


   3.1 Table of the frequency of occurrence of species

        Species represented 1x: 5236
                            2x: 1694
                            3x:  835
                            4x:  548
                            5x:  419
                            6x:  320
                            7x:  232
                            8x:  193
                            9x:  169
                           10x:  107
                       11- 20x:  516
                       21- 50x:  351
                       51-100x:  139
                         >100x:  712


   3.2  Table of the most represented species

  ------  ---------  --------------------------------------------
  Number  Frequency  Species
  ------  ---------  --------------------------------------------
       1      20069  Homo sapiens (Human)
       2      15813  Mus musculus (Mouse)
       3       7122  Rattus norvegicus (Rat)
       4       6914  Arabidopsis thaliana (Mouse-ear cress)
       5       6553  Saccharomyces cerevisiae (Baker's yeast)
       6       5371  Bos taurus (Bovine)
       7       4421  Schizosaccharomyces pombe (Fission yeast)
       8       4342  Escherichia coli (strain K12)
       9       3188  Caenorhabditis elegans
      10       2878  Bacillus subtilis
      11       2817  Drosophila melanogaster (Fruit fly)
      12       2816  Xenopus laevis (African clawed frog)
      13       2479  Dictyostelium discoideum (Slime mold)
      14       2194  Danio rerio (Zebrafish) (Brachydanio rerio)
      15       2125  Pongo abelii (Sumatran orangutan)
      16       2054  Gallus gallus (Chicken)
      17       1950  Escherichia coli O157:H7
      18       1782  Methanocaldococcus jannaschii (Methanococcus jannaschii)
      19       1774  Haemophilus influenzae
      20       1716  Oryza sativa subsp. japonica (Rice)
      21       1700  Salmonella typhimurium
      22       1627  Escherichia coli O6
      23       1625  Shigella flexneri
      24       1445  Mycobacterium tuberculosis
      25       1323  Sus scrofa (Pig)
      26       1292  Salmonella typhi
      27       1241  Pseudomonas aeruginosa
      28       1187  Xenopus tropicalis (Western clawed frog) (Silurana tropicalis)
      29       1183  Mycobacterium bovis
      30       1121  Macaca fascicularis (Crab eating macaque) (Cynomolgus monkey)
      31        990  Synechocystis sp. (strain PCC 6803)
      32        981  Archaeoglobus fulgidus
      33        953  Yersinia pestis
      34        912  Vibrio cholerae
      35        909  Acanthamoeba polyphaga mimivirus (APMV)
      36        888  Rhizobium meliloti (Sinorhizobium meliloti)
      37        873  Oryctolagus cuniculus (Rabbit)
      38        866  Salmonella paratyphi A
      39        864  Staphylococcus aureus (strain Mu50 / ATCC 700699)
      40        863  Staphylococcus aureus (strain N315)
      41        835  Staphylococcus aureus (strain MW2)
      42        835  Staphylococcus aureus (strain COL)
      43        831  Staphylococcus aureus (strain MSSA476)
      44        828  Staphylococcus aureus (strain MRSA252)
      45        814  Salmonella choleraesuis
      46        809  Yersinia pseudotuberculosis
      47        808  Escherichia coli O6:K15:H31 (strain 536 / UPEC)
      48        808  Shigella sonnei (strain Ss046)
      49        765  Shigella boydii serotype 4 (strain Sb227)
      50        764  Vibrio parahaemolyticus
      51        763  Ashbya gossypii (Yeast) (Eremothecium gossypii)
      52        759  Aquifex aeolicus
      53        754  Pasteurella multocida
      54        748  Shigella dysenteriae serotype 1 (strain Sd197)
      55        747  Escherichia coli O9:H4 (strain HS)
      56        747  Canis familiaris (Dog)
      57        744  Escherichia coli (strain UTI89 / UPEC)
      58        743  Escherichia coli O139:H28 (strain E24377A / ETEC)
      59        736  Kluyveromyces lactis (Yeast) (Candida sphaerica)
      60        727  Candida albicans (Yeast)
      61        724  Erwinia carotovora subsp. atroseptica (Pectobacterium atrosepticum)
      62        717  Neurospora crassa
      63        711  Escherichia coli (strain ATCC 8739 / DSM 1576 / Crooks)
      64        707  Streptomyces coelicolor
      65        705  Vibrio vulnificus
      66        700  Staphylococcus epidermidis (strain ATCC 35984 / RP62A)
      67        699  Staphylococcus epidermidis (strain ATCC 12228)
      68        694  Candida glabrata (Yeast) (Torulopsis glabrata)
      69        692  Photorhabdus luminescens subsp. laumondii
      70        689  Bacillus halodurans
      71        688  Vibrio vulnificus (strain YJ016)
      72        687  Mycoplasma pneumoniae
      73        685  Shigella flexneri serotype 5b (strain 8401)
      74        671  Pan troglodytes (Chimpanzee)
      75        665  Bacillus anthracis
      76        655  Yersinia pestis bv. Antiqua (strain Nepal516)
      77        654  Anabaena sp. (strain PCC 7120)
      78        650  Yersinia enterocolitica serotype O:8 / biotype 1B (strain 8081)
      79        649  Yersinia pestis bv. Antiqua (strain Antiqua)
      80        647  Mycobacterium leprae
      81        639  Pseudomonas syringae pv. tomato
      82        637  Pseudomonas putida (strain KT2440)
      83        636  Yersinia pseudotuberculosis serotype O:1b (strain IP 31758)
      84        630  Escherichia coli O1:K1 / APEC
      85        627  Staphylococcus aureus (strain NCTC 8325)
      86        620  Escherichia coli
      87        618  Salmonella paratyphi B (strain ATCC BAA-1250 / SPB7)
      88        617  Bradyrhizobium japonicum
      89        613  Treponema pallidum
      90        612  Enterobacter sp. (strain 638)
      91        609  Zea mays (Maize)
      92        599  Klebsiella pneumoniae subsp. pneumoniae (strain ATCC 700721 / MGH 78578)
      93        598  Yersinia pestis (strain Pestoides F)
      94        595  Methanobacterium thermoautotrophicum
      95        592  Bacillus cereus (strain ATCC 14579 / DSM 31)
      96        592  Agrobacterium tumefaciens (strain C58 / ATCC 33970)
      97        589  Citrobacter koseri (strain ATCC BAA-895 / CDC 4225-83 / SGSC4696)
      98        586  Ralstonia solanacearum (Pseudomonas solanacearum)
      99        581  Shewanella oneidensis
     100        581  Rickettsia prowazekii
     101        580  Staphylococcus aureus (strain USA300)
     102        579  Helicobacter pylori (Campylobacter pylori)
     103        578  Rhizobium loti (Mesorhizobium loti)
     104        575  Serratia proteamaculans (strain 568)
     105        572  Buchnera aphidicola subsp. Acyrthosiphon pisum 
     106        569  Listeria monocytogenes
     107        567  Staphylococcus aureus (strain bovine RF122 / ET3-1)
     108        566  Lactococcus lactis subsp. lactis (Streptococcus lactis)
     109        562  Buchnera aphidicola subsp. Schizaphis graminum
     110        561  Listeria innocua
     111        560  Photobacterium profundum (Photobacterium sp. (strain SS9))
     112        560  Helicobacter pylori J99 (Campylobacter pylori J99)
     113        559  Neisseria meningitidis serogroup B
     114        556  Xanthomonas campestris pv. campestris
     115        554  Salmonella arizonae (strain ATCC BAA-731 / CDC346-86 / RSK2980)
     116        546  Staphylococcus haemolyticus (strain JCSC1435)
     117        541  Staphylococcus saprophyticus subsp. saprophyticus 
     118        540  Neisseria meningitidis serogroup A
     119        538  Brucella melitensis
     120        535  Brucella suis
     121        534  Bacillus cereus (strain ATCC 10987)
     122        532  Yarrowia lipolytica (Candida lipolytica)
     123        531  Clostridium acetobutylicum
     124        529  Enterobacter sakazakii (strain ATCC BAA-894)
     125        528  Caulobacter crescentus (Caulobacter vibrioides)
     126        521  Emericella nidulans (Aspergillus nidulans)
     127        521  Debaryomyces hansenii (Yeast) (Torulaspora hansenii)
     128        521  Xanthomonas axonopodis pv. citri
     129        515  Oceanobacillus iheyensis
     130        514  Bacillus thuringiensis subsp. konkukian
     131        509  Pseudomonas syringae pv. syringae (strain B728a)
     132        507  Buchnera aphidicola subsp. Baizongia pistaciae
     133        507  Streptococcus pneumoniae
     134        504  Vibrio fischeri (strain ATCC 700601 / ES114)
     135        503  Pseudomonas fluorescens (strain PfO-1)
     136        502  Bacillus cereus (strain ZK / E33L)
     137        502  Listeria monocytogenes serotype 4b (strain F2365)
     138        501  Pseudomonas aeruginosa (strain UCBPP-PA14)
     139        499  Xylella fastidiosa
     140        498  Pseudomonas fluorescens (strain Pf-5 / ATCC BAA-477)
     141        497  Thermotoga maritima
     142        493  Bacillus licheniformis (strain DSM 13 / ATCC 14580)
     143        493  Bordetella bronchiseptica (Alcaligenes bronchisepticus)
     144        491  Rickettsia conorii
     145        490  Xylella fastidiosa (strain Temecula1 / ATCC 700964)
     146        488  Pseudomonas syringae pv. phaseolicola (strain 1448A / Race 6)
     147        483  Mycoplasma genitalium
     148        481  Bordetella parapertussis
     149        481  Chromobacterium violaceum
     150        481  Haemophilus ducreyi
     151        480  Bordetella pertussis
     152        478  Deinococcus radiodurans
     153        475  Sodalis glossinidius (strain morsitans)
     154        473  Clostridium perfringens
     155        470  Corynebacterium glutamicum (Brevibacterium flavum)
     156        467  Vibrio cholerae serotype O1 (strain ATCC 39541 / Ogawa 395 / O395)
     157        464  Methanosarcina acetivorans
     158        461  Brucella abortus
     159        458  Haemophilus influenzae (strain 86-028NP)
     160        456  Pyrococcus horikoshii
     161        456  Mannheimia succiniciproducens (strain MBEL55E)
     162        455  Pseudomonas entomophila (strain L48)
     163        452  Pyrococcus abyssi
     164        452  Streptomyces avermitilis
     165        451  Xanthomonas campestris pv. campestris (strain 8004)
     166        450  Burkholderia pseudomallei (Pseudomonas pseudomallei)
     167        448  Pseudomonas aeruginosa (strain PA7)
     168        448  Enterococcus faecalis (Streptococcus faecalis)
     169        448  Halobacterium salinarium (Halobacterium halobium)
     170        447  Bacillus clausii (strain KSM-K16)
     171        446  Rickettsia felis (Rickettsia azadi)
     172        444  Streptococcus pneumoniae (strain ATCC BAA-255 / R6)
     173        444  Methanosarcina mazei (Methanosarcina frisia)
     174        442  Shewanella sp. (strain MR-7)
     175        441  Synechococcus elongatus (Thermosynechococcus elongatus)
     176        441  Geobacillus kaustophilus
     177        440  Lactobacillus plantarum
     178        440  Vibrio harveyi (strain ATCC BAA-1116 / BB120)
     179        439  Shewanella sp. (strain MR-4)
     180        436  Streptococcus mutans
     181        436  Chlamydia trachomatis
     182        434  Thermoanaerobacter tengcongensis
     183        434  Oryza sativa subsp. indica (Rice)
     184        433  Rickettsia bellii (strain RML369-C)
     185        433  Pyrococcus furiosus
     186        432  Ovis aries (Sheep)
     187        432  Synechococcus elongatus (strain PCC 7942) (Anacystis nidulans R2)
     188        430  Brucella abortus (strain 2308)
     189        429  Streptococcus pyogenes serotype M6
     190        428  Acinetobacter sp. (strain ADP1)
     191        427  Borrelia burgdorferi (Lyme disease spirochete)
     192        427  Burkholderia mallei (Pseudomonas mallei)
     193        427  Nicotiana tabacum (Common tobacco)
     194        426  Rhodopseudomonas palustris
     195        424  Anabaena variabilis (strain ATCC 29413 / PCC 7937)
     196        423  Burkholderia sp. (strain 383) (Burkholderia cepacia 
     197        422  Campylobacter jejuni
     198        421  Xanthomonas campestris pv. vesicatoria (strain 85-10)
     199        420  Pseudomonas putida (strain F1 / ATCC 700007)
     200        419  Chlamydia pneumoniae (Chlamydophila pneumoniae)
     201        416  Ralstonia eutropha (strain JMP134) (Alcaligenes eutrophus)
     202        414  Staphylococcus aureus (strain Newman)
     203        414  Shewanella frigidimarina (strain NCIMB 400)
     204        414  Aspergillus fumigatus (Sartorya fumigata)
     205        413  Shewanella sp. (strain ANA-3)
     206        412  Xanthomonas oryzae pv. oryzae (strain MAFF 311018)
     207        412  Pseudomonas putida (strain GB-1)
     208        410  Methylococcus capsulatus
     209        409  Chlamydia muridarum
     210        409  Streptococcus pyogenes serotype M1
     211        408  Rhizobium sp. (strain NGR234)
     212        408  Ralstonia eutropha  (Cupriavidus necator 
     213        407  Sulfolobus solfataricus
     214        405  Rhodobacter sphaeroides (strain ATCC 17023 / 2.4.1 / NCIB 8253 / DSM 158)
     215        405  Streptococcus pyogenes serotype M18
     216        403  Rickettsia typhi
     217        403  Streptococcus pyogenes serotype M3
     218        402  Bacillus amyloliquefaciens (strain FZB42)
     219        400  Shewanella baltica (strain OS185)
     220        400  Nitrosomonas europaea
     221        398  Gloeobacter violaceus
     222        398  Staphylococcus aureus (strain Mu3 / ATCC 700698)
     223        397  Hahella chejuensis (strain KCTC 2396)
     224        397  Solanum lycopersicum (Tomato) (Lycopersicon esculentum)
     225        395  Aeromonas hydrophila subsp. hydrophila (strain ATCC 7966 / NCIB 9240)
     226        395  Pseudoalteromonas haloplanktis (strain TAC 125)
     227        393  Corynebacterium efficiens
     228        392  Dechloromonas aromatica (strain RCB)
     229        389  Neisseria gonorrhoeae (strain ATCC 700825 / FA 1090)
     230        389  Chlorobium tepidum
     231        389  Shewanella sp. (strain W3-18-1)
     232        389  Colwellia psychrerythraea (strain 34H / ATCC BAA-681) (Vibrio psychroerythus)
     233        388  Shewanella putrefaciens (strain CN-32 / ATCC BAA-453)
     234        387  Burkholderia xenovorans (strain LB400)
     235        385  Pseudomonas mendocina (strain ymp)
     236        385  Synechococcus sp. (strain ATCC 27144 / PCC 6301 / SAUG 1402/1) 
     237        384  Mycobacterium paratuberculosis
     238        384  Idiomarina loihiensis
     239        382  Shewanella denitrificans (strain OS217 / ATCC BAA-1090 / DSM 15013)
     240        382  Shewanella baltica (strain OS195)
     241        381  Haemophilus influenzae (strain PittEE)
     242        381  Synechococcus sp. (strain WH8102)
     243        381  Pyrococcus kodakaraensis (Thermococcus kodakaraensis)
     244        380  Burkholderia thailandensis (strain E264 / ATCC 700388 / DSM 13276 / CIP 106301)
     245        380  Aeromonas salmonicida (strain A449)
     246        379  Shewanella baltica (strain OS155 / ATCC BAA-1091)
     247        377  Actinobacillus pleuropneumoniae serotype 5b (strain L20)
     248        374  Solanum tuberosum (Potato)
     249        374  Shewanella amazonensis (strain ATCC BAA-1098 / SB2B)
     250        374  Burkholderia cenocepacia (strain AU 1054)
     251        372  Prochlorococcus marinus (strain MIT 9313)
     252        372  Azoarcus sp. (strain EbN1) (Aromatoleum aromaticum (strain EbN1))
     253        372  Streptococcus agalactiae serotype III
     254        371  Burkholderia pseudomallei (strain 1710b)
     255        370  Xanthomonas oryzae pv. oryzae
     256        369  Staphylococcus aureus (strain JH1)
     257        369  Shewanella loihica (strain ATCC BAA-1088 / PV-4)
     258        368  Streptococcus agalactiae serotype V
     259        368  Coxiella burnetii
     260        367  Methanopyrus kandleri
     261        367  Listeria welshimeri serovar 6b (strain ATCC 35897 / DSM 20650 / SLCC5334)
     262        365  Rhizobium etli (strain CFN 42 / ATCC 51251)
     263        365  Bacillus cereus subsp. cytotoxis (strain NVH 391-98)
     264        364  Prochlorococcus marinus
     265        363  Staphylococcus aureus (strain JH9)
     266        363  Leptospira interrogans
     267        363  Geobacter sulfurreducens
     268        357  Aeropyrum pernix
     269        356  Haemophilus somnus (strain 129Pt) (Histophilus somni (strain 129Pt))
     270        356  Nitrosococcus oceani (strain ATCC 19707 / NCIMB 11848)
     271        355  Haemophilus influenzae (strain PittGG)
     272        353  Leptospira interrogans serogroup Icterohaemorrhagiae serovar copenhageni
     273        352  Burkholderia cenocepacia (strain HI2424)
     274        352  Shewanella halifaxensis (strain HAW-EB4)
     275        352  Thermus thermophilus (strain HB8 / ATCC 27634 / DSM 579)
     276        351  Ralstonia metallidurans (strain CH34 / ATCC 43123 / DSM 2839)
     277        351  Rhizobium leguminosarum bv. viciae (strain 3841)
     278        351  Pisum sativum (Garden pea)
     279        349  Legionella pneumophila (strain Paris)
     280        348  Bacillus pumilus (strain SAFR-032)
     281        348  Legionella pneumophila (strain Lens)
     282        348  Chromohalobacter salexigens (strain DSM 3043 / ATCC BAA-138 / NCIMB 13768)
     283        347  Sulfolobus tokodaii
     284        346  Actinobacillus succinogenes (strain ATCC 55618 / 130Z)
     285        345  Thiobacillus denitrificans (strain ATCC 25259)
     286        345  Nocardia farcinica
     287        345  Psychromonas ingrahamii (strain 37)
     288        345  Shewanella pealeana (strain ATCC 700345 / ANG-SQ1)
     289        345  Prochlorococcus marinus subsp. pastoris (strain CCMP1378 / MED4)
     290        343  Glycine max (Soybean)
     291        342  Mycobacterium tuberculosis (strain ATCC 25177 / H37Ra)
     292        342  Neisseria meningitidis serogroup C / serotype 2a (strain ATCC 700532 / FAM18)
     293        342  Legionella pneumophila subsp. pneumophila 
     294        340  Saccharophagus degradans (strain 2-40 / ATCC 43961 / DSM 17024)
     295        339  Silicibacter pomeroyi
     296        339  Desulfovibrio vulgaris (strain Hildenborough / ATCC 29579 / NCIMB 8303)
     297        339  Burkholderia ambifaria (strain ATCC BAA-244 / AMMD) (Burkholderia cepacia 
     298        338  Pseudoalteromonas atlantica (strain T6c / BAA-1087)
     299        338  Shewanella sediminis (strain HAW-EB3)
     300        336  Macaca mulatta (Rhesus macaque)
     301        332  Geobacillus thermodenitrificans (strain NG80-2)
     302        331  Staphylococcus aureus (strain USA300 / TCH1516)
     303        331  Caenorhabditis briggsae
     304        331  Rhodopirellula baltica
     305        330  Mycobacterium bovis (strain BCG / Pasteur 1173P2)
     306        329  Burkholderia vietnamiensis (strain G4 / LMG 22486) (Burkholderia cepacia 
     307        329  Lactococcus lactis subsp. cremoris (strain MG1363)
     308        329  Nitrosospira multiformis (strain ATCC 25196 / NCIMB 11849)
     309        329  Bordetella avium (strain 197N)
     310        328  Pseudomonas stutzeri (strain A1501)
     311        328  Rhodoferax ferrireducens (strain DSM 15236 / ATCC BAA-621 / T118)
     312        327  Symbiobacterium thermophilum
     313        326  Zymomonas mobilis
     314        326  Fusobacterium nucleatum subsp. nucleatum
     315        324  Burkholderia pseudomallei (strain 1106a)
     316        322  Clostridium perfringens (strain ATCC 13124 / NCTC 8237 / Type A)
     317        322  Thermoplasma acidophilum
     318        321  Thermus thermophilus (strain HB27 / ATCC BAA-163 / DSM 7039)
     319        321  Wolinella succinogenes
     320        321  Methanococcus maripaludis
     321        321  Rhodospirillum rubrum (strain ATCC 11170 / NCIB 8255)
     322        320  Alcanivorax borkumensis (strain SK2 / ATCC 700651 / DSM 11573)
     323        319  Bacillus thuringiensis (strain Al Hakam)
     324        319  Methylobacillus flagellatus (strain KT / ATCC 51484 / DSM 6875)
     325        319  Geobacter metallireducens (strain GS-15 / ATCC 53774 / DSM 7210)
     326        318  Triticum aestivum (Wheat)
     327        318  Streptococcus agalactiae serotype Ia
     328        318  Bacteroides thetaiotaomicron
     329        317  Rhodopseudomonas palustris (strain HaA2)
     330        316  Corynebacterium diphtheriae
     331        316  Pelobacter carbinolicus (strain DSM 2380 / Gra Bd 1)
     332        315  Burkholderia pseudomallei (strain 668)
     333        315  Rhodopseudomonas palustris (strain BisB18)
     334        315  Sinorhizobium medicae (strain WSM419) (Ensifer medicae)
     335        315  Azoarcus sp. (strain BH72)
     336        314  Marinobacter aquaeolei  (Marinobacter hydrocarbonoclasticus 
     337        313  Clostridium tetani
     338        313  Burkholderia mallei (strain NCTC 10247)
     339        312  Methanosarcina barkeri (strain Fusaro / DSM 804)
     340        312  Brucella canis (strain ATCC 23365 / NCTC 10854)
     341        312  Brucella suis (strain ATCC 23445 / NCTC 10510)
     342        311  Hordeum vulgare (Barley)
     343        311  Campylobacter jejuni (strain RM1221)
     344        311  Nitrobacter winogradskyi (strain Nb-255 / ATCC 25391)
     345        310  Thiomicrospira crunogena (strain XCL-2)
     346        309  Streptococcus pneumoniae serotype 2 (strain D39 / NCTC 7466)
     347        309  Alkalilimnicola ehrlichei (strain MLHE-1)
     348        308  Burkholderia mallei (strain NCTC 10229)
     349        308  Prochlorococcus marinus (strain NATL2A)
     350        305  Clostridium perfringens (strain SM101 / Type A)
     351        305  Ochrobactrum anthropi (strain ATCC 49188 / DSM 6882 / NCTC 12168)
     352        304  Sulfolobus acidocaldarius
     353        304  Rhodopseudomonas palustris (strain BisB5)
     354        303  Carboxydothermus hydrogenoformans (strain Z-2901 / DSM 6008)
     355        302  Haloarcula marismortui (Halobacterium marismortui)
     356        302  Bacteroides fragilis
     357        301  Nitrobacter hamburgensis (strain X14 / DSM 10229)
     358        300  Burkholderia mallei (strain SAVP1)
     359        300  Gluconobacter oxydans (Gluconobacter suboxydans)
     360        300  Mesorhizobium sp. (strain BNC1)
     361        300  Streptococcus thermophilus (strain CNRZ 1066)
     362        298  Roseobacter denitrificans (strain ATCC 33942 / OCh 114) (Erythrobacter sp.  
     363        298  Streptococcus thermophilus (strain ATCC BAA-250 / LMG 18311)
     364        297  Synechococcus sp. (strain CC9902)
     365        297  Cryptococcus neoformans (Filobasidiella neoformans)
     366        297  Prochlorococcus marinus (strain MIT 9312)
     367        295  Staphylococcus aureus
     368        295  Bartonella henselae (Rochalimaea henselae)
     369        295  Psychrobacter arcticus (strain DSM 17307 / 273-4)
     370        294  Pyrobaculum aerophilum
     371        294  Nitrosomonas eutropha (strain C91)
     372        293  Cavia porcellus (Guinea pig)
     373        293  Helicobacter hepaticus
     374        291  Lactococcus lactis subsp. cremoris (strain SK11)
     375        290  Streptococcus sanguinis (strain SK36)
     376        290  Desulfotalea psychrophila
     377        289  Streptococcus gordonii (strain Challis / ATCC 35105 / CH1 / DL1 / V288)
     378        289  Legionella pneumophila (strain Corby)
     379        289  Synechococcus sp. (strain JA-3-3Ab) 
     380        289  Thermoplasma volcanium
     381        289  Bartonella quintana (Rochalimaea quintana)
     382        288  Synechococcus sp. (strain CC9605)
     383        288  Synechococcus sp. (strain JA-2-3B'a(2-13)) 
     384        287  Moorella thermoacetica (strain ATCC 39073)
     385        286  Brucella ovis (strain ATCC 25840 / 63/290 / NCTC 10512)
     386        286  Streptococcus pyogenes serotype M28
     387        286  Psychrobacter cryohalolentis (strain K5)
     388        286  Halorhodospira halophila (strain DSM 244 / SL1) (Ectothiorhodospira halophila 
     389        285  Pseudomonas putida
     390        284  Jannaschia sp. (strain CCS1)
     391        284  Streptococcus pyogenes serotype M5 (strain Manfredo)
     392        282  Rhodopseudomonas palustris (strain BisA53)
     393        282  Haemophilus somnus (strain 2336) (Histophilus somni (strain 2336))
     394        282  Lactobacillus sakei subsp. sakei (strain 23K)
     395        281  Rhodobacter sphaeroides (strain ATCC 17029 / ATH 2.4.9)
     396        280  Trichodesmium erythraeum (strain IMS101)
     397        280  Silicibacter sp. (strain TM1040)
     398        280  Bifidobacterium longum
     399        279  Ustilago maydis (Smut fungus)
     400        279  Streptococcus thermophilus (strain ATCC BAA-491 / LMD-9)
     401        279  Wigglesworthia glossinidia brevipalpis
     402        278  Spinacia oleracea (Spinach)
     403        277  Campylobacter jejuni subsp. jejuni serotype O:23/36 (strain 81-176)
     404        277  Bradyrhizobium sp. (strain BTAi1 / ATCC BAA-1182)
     405        276  Lactobacillus johnsonii
     406        275  Campylobacter jejuni subsp. jejuni serotype O:6 (strain 81116 / NCTC 11828)
     407        275  Porphyromonas gingivalis (Bacteroides gingivalis)
     408        274  Equus caballus (Horse)
     409        274  Propionibacterium acnes
     410        272  Gorilla gorilla gorilla (Lowland gorilla)
     411        272  Polaromonas sp. (strain JS666 / ATCC BAA-500)
     412        272  Leifsonia xyli subsp. xyli
     413        270  Bacteroides fragilis (strain ATCC 25285 / NCTC 9343)
     414        269  Francisella tularensis subsp. tularensis
     415        269  Bradyrhizobium sp. (strain ORS278)
     416        269  Clostridium botulinum (strain Langeland / NCTC 10281 / Type F)
     417        269  Aspergillus oryzae
     418        268  Blochmannia floridanus
     419        268  Rhodococcus sp. (strain RHA1)
     420        268  Bacteriophage T4
     421        268  Desulfovibrio desulfuricans (strain G20)
     422        268  Acidovorax avenae subsp. citrulli (strain AAC00-1)
     423        267  Helicobacter pylori (strain HPAG1)
     424        267  Anaeromyxobacter dehalogenans (strain 2CP-C)
     425        266  Magnetospirillum magneticum (strain AMB-1 / ATCC 700264)
     426        265  Lactobacillus acidophilus
     427        265  Clostridium novyi (strain NT)
     428        264  Janthinobacterium sp. (strain Marseille) (Minibacterium massiliensis)
     429        264  Mycobacterium ulcerans (strain Agy99)
     430        264  Chlorobium chlorochromatii (strain CaD3)
     431        263  Ureaplasma parvum (Ureaplasma urealyticum biotype 1)
     432        263  Neisseria meningitidis serogroup C (strain 053442)
     433        262  Rhodobacter capsulatus (Rhodopseudomonas capsulata)
     434        262  Paracoccus denitrificans (strain Pd 1222)
     435        262  Streptococcus pyogenes serotype M12 (strain MGAS9429)
     436        261  Streptococcus pyogenes serotype M4 (strain MGAS10750)
     437        260  Corynebacterium glutamicum (strain R)
     438        260  Desulfitobacterium hafniense (strain Y51)
     439        260  Chlamydophila caviae
     440        258  Streptococcus pyogenes serotype M2 (strain MGAS10270)
     441        258  Polaromonas naphthalenivorans (strain CJ2)
     442        257  Myxococcus xanthus (strain DK 1622)
     443        257  Clostridium beijerinckii (strain ATCC 51743 / NCIMB 8052) 
     444        257  Francisella tularensis subsp. holarctica (strain LVS)
     445        257  Prochlorococcus marinus (strain MIT 9301)
     446        257  Mycobacterium smegmatis (strain ATCC 700084 / mc(2)155)
     447        257  Synechococcus sp. (strain CC9311)
     448        256  Thermotoga petrophila (strain RKU-1 / ATCC BAA-488 / DSM 13995)
     449        256  Herminiimonas arsenicoxydans
     450        256  Pelodictyon luteolum (strain DSM 273) (Chlorobium luteolum (strain DSM 273))
     451        255  Acidovorax sp. (strain JS42)
     452        255  Clostridium thermocellum (strain ATCC 27405 / DSM 1237)
     453        255  Prochlorococcus marinus (strain MIT 9515)
     454        255  Synechococcus sp. (strain WH7803)
     455        255  Mycobacterium avium (strain 104)
     456        254  Clostridium botulinum (strain ATCC 19397 / Type A)
     457        254  Vaccinia virus (strain Copenhagen) (VACV)
     458        253  Thermobifida fusca (strain YX)
     459        253  Corynebacterium jeikeium (strain K411)
     460        253  Novosphingobium aromaticivorans (strain DSM 12444)
     461        252  Prochlorococcus marinus (strain AS9601)
     462        252  Mycobacterium vanbaalenii (strain DSM 7251 / PYR-1)
     463        251  Mycobacterium sp. (strain MCS)
     464        250  Lactobacillus salivarius subsp. salivarius (strain UCC118)
     465        250  Bdellovibrio bacteriovorus
     466        249  Rhodobacter sphaeroides (strain ATCC 17025 / ATH 2.4.3)
     467        248  Methylibium petroleiphilum (strain PM1)
     468        248  Clostridium kluyveri (strain ATCC 8527 / DSM 555 / NCIMB 10680)
     469        248  Campylobacter jejuni subsp. doylei (strain ATCC BAA-1458 / RM4099 / 269.97)
     470        247  Alkaliphilus metalliredigens (strain QYMF)
     471        246  Blochmannia pennsylvanicus (strain BPEN)
     472        246  Prochlorococcus marinus (strain NATL1A)
     473        246  Marinomonas sp. (strain MWYL1)
     474        245  Prochlorococcus marinus (strain MIT 9215)
     475        245  Azorhizobium caulinodans (strain ATCC 43989 / DSM 5975 / ORS 571)
     476        244  Coxiella burnetii (strain Dugway 5J108-111)
     477        244  Sulfurimonas denitrificans  (Thiomicrospira denitrificans 
     478        244  Coxiella burnetii (strain RSA 331 / Henzerling II)
     479        244  Streptococcus pyogenes serotype M12 (strain MGAS2096)
     480        244  Geobacter uraniireducens (strain Rf4) (Geobacter uraniumreducens)
     481        243  Mycobacterium sp. (strain KMS)
     482        243  Clostridium difficile (strain 630)
     483        242  Francisella tularensis subsp. tularensis (strain FSC 198)
     484        241  Mycobacterium sp. (strain JLS)
     485        241  Desulfovibrio vulgaris subsp. vulgaris (strain DP4)
     486        240  Lactobacillus casei (strain ATCC 334)
     487        240  Prochlorococcus marinus (strain MIT 9303)
     488        239  Francisella tularensis subsp. novicida (strain U112)
     489        238  Treponema denticola
     490        237  Acaryochloris marina (strain MBIC 11017)
     491        237  Bacillus stearothermophilus (Geobacillus stearothermophilus)
     492        237  Francisella tularensis subsp. holarctica (strain OSU18)
     493        236  Baumannia cicadellinicola subsp. Homalodisca coagulata
     494        235  Clostridium botulinum (strain Hall / ATCC 3502 / NCTC 13319 / Type A)
     495        235  Natronomonas pharaonis (strain DSM 2160 / ATCC 35678)
     496        235  Syntrophus aciditrophicus (strain SB)
     497        234  Sphingopyxis alaskensis (Sphingomonas alaskensis)
     498        234  Methanococcus vannielii (strain SB / ATCC 35089 / DSM 1224)
     499        234  Leptospira borgpetersenii serovar Hardjo-bovis (strain JB197)
     500        233  Hyphomonas neptunium (strain ATCC 15444)
     501        232  Pediococcus pentosaceus (strain ATCC 25745 / 183-1w)
     502        232  Methanococcus maripaludis (strain C7 / ATCC BAA-1331)
     503        232  Chlorobium phaeobacteroides (strain DSM 266)
     504        231  Chlamydomonas reinhardtii
     505        231  Verminephrobacter eiseniae (strain EF01-2)
     506        230  Pelobacter propionicus (strain DSM 2379)
     507        230  Alkaliphilus oremlandii (strain OhILAs) (Clostridium oremlandii (strain OhILAs))
     508        229  Helicobacter acinonychis (strain Sheeba)
     509        229  Methanococcus maripaludis (strain C5 / ATCC BAA-1333)
     510        229  Maricaulis maris (strain MCS10)
     511        229  Deinococcus geothermalis (strain DSM 11300)
     512        226  Chlamydia trachomatis (strain A/HAR-13 / ATCC VR-571B)
     513        226  Francisella tularensis subsp. tularensis (strain WY96-3418)
     514        225  Protochlamydia amoebophila (strain UWE25)
     515        224  Cricetulus griseus (Chinese hamster)
     516        223  Desulfotomaculum reducens (strain MI-1)
     517        223  Francisella tularensis subsp. holarctica (strain FTA)
     518        223  Syntrophomonas wolfei subsp. wolfei (strain Goettingen)
     519        222  Dinoroseobacter shibae (strain DFL 12)
     520        221  Frankia sp. (strain CcI3)
     521        221  Caulobacter sp. (strain K31)
     522        220  Syntrophobacter fumaroxidans (strain DSM 10017 / MPOB)
     523        220  Lactobacillus brevis (strain ATCC 367 / JCM 1170)
     524        219  Synechococcus sp. (strain RCC307)
     525        219  Bartonella tribocorum (strain CIP 105476 / IBS 506)
     526        218  Lactobacillus delbrueckii subsp. bulgaricus (strain ATCC 11842 / DSM 20081)
     527        218  Chlamydophila abortus
     528        217  Felis silvestris catus (Cat)
     529        217  Porphyra purpurea
     530        217  Leptospira borgpetersenii serovar Hardjo-bovis (strain L550)
     531        217  Bartonella bacilliformis (strain ATCC 35685 / KC583)
     532        217  Methanococcoides burtonii (strain DSM 6242)
     533        216  Dehalococcoides sp. (strain CBDB1)
     534        215  Dehalococcoides ethenogenes (strain 195)
     535        215  Rickettsia akari (strain Hartford)
     536        214  Klebsiella pneumoniae
     537        212  Granulibacter bethesdensis (strain ATCC BAA-1260 / CGDNIH1)
     538        212  Parvibaculum lavamentivorans (strain DS-1 / DSM 13023 / NCIMB 13966)
     539        211  Rickettsia canadensis (strain McKiel)
     540        210  Mycobacterium gilvum (strain PYR-GCK) (Mycobacterium flavescens 
     541        210  Francisella philomiragia subsp. philomiragia (strain ATCC 25017)
     542        210  Anaeromyxobacter sp. (strain Fw109-5)
     543        210  Rickettsia rickettsii (strain Sheila Smith)
     544        210  Bacteroides vulgatus (strain ATCC 8482 / DSM 1447 / NCTC 11154)
     545        209  Gibberella zeae (Fusarium graminearum)
     546        209  Streptococcus suis (strain 98HAH33)
     547        208  Nitratiruptor sp. (strain SB155-2)
     548        208  Porphyra yezoensis
     549        208  Caldicellulosiruptor saccharolyticus (strain ATCC 43494 / DSM 8903)
     550        207  Pelagibacter ubique
     551        206  Magnetococcus sp. (strain MC-1)
     552        206  Mesocricetus auratus (Golden hamster)
     553        206  Salinibacter ruber (strain DSM 13855)
     554        206  Prosthecochloris vibrioformis  (Chlorobium vibrioforme subsp. thiosulfatophilum  (Chlorobium phaeovibrioides 
     555        204  Chlamydophila felis (strain Fe/C-56)
     556        204  Lactobacillus delbrueckii subsp. bulgaricus (strain ATCC BAA-365)
     557        204  Psychrobacter sp. (strain PRwf-1)
     558        203  Encephalitozoon cuniculi
     559        203  Tropheryma whipplei (strain TW08/27) (Whipple's bacillus)
     560        202  Tropheryma whipplei (strain Twist) (Whipple's bacillus)
     561        202  Parabacteroides distasonis (strain ATCC 8503 / DSM 20701 / NCTC 11152)
     562        202  Lactobacillus reuteri (strain ATCC 23272 / DSM 20016 / F275)
     563        201  Acidiphilium cryptum (strain JF-5)
     564        201  Sphingomonas wittichii (strain RW1 / DSM 6014 / JCM 10273)
     565        201  Vaccinia virus (strain Western Reserve / WR) (VACV)
     566        201  Acidobacteria bacterium (strain Ellin345)
     567        201  Rubrobacter xylanophilus (strain DSM 9941 / NBRC 16129)
     568        200  Picrophilus torridus
     569        200  Saccharopolyspora erythraea (strain NRRL 23338)


   
   3.3  Taxonomic distribution of the sequences

   Kingdom        sequences (% of the database)
    Archaea           14694 (  4%)
    Bacteria         224003 ( 57%)
    Eukaryota        141583 ( 36%)
    Viruses           12387 (  3%)


   Within Eukaryota:

    Category            sequences (% of Eukaryota) (% of the complete database)
     Human                  20070 ( 14%)           (  5%)
     Other Mammalia         42975 ( 30%)           ( 11%)
     Other Vertebrata       13982 ( 10%)           (  4%)
     Viridiplantae          23475 ( 17%)           (  6%)
     Fungi                  21941 ( 15%)           (  6%)
     Insecta                 5528 (  4%)           (  1%)
     Nematoda                3765 (  3%)           (  1%)
     Other                   9847 (  7%)           (  3%)


4.  SEQUENCE SIZE

   Repartition of the sequences by size (excluding fragments)

               From   To  Number             From   To   Number
                  1-  50    6645             1001-1100     2928
                 51- 100   29356             1101-1200     1993
                101- 150   41958             1201-1300     1582
                151- 200   41301             1301-1400     1428
                201- 250   41103             1401-1500     1115
                251- 300   36011             1501-1600      556
                301- 350   35089             1601-1700      435
                351- 400   30808             1701-1800      380
                401- 450   25341             1801-1900      352
                451- 500   21000             1901-2000      282
                501- 550   14954             2001-2100      178
                551- 600   10904             2101-2200      246
                601- 650    9329             2201-2300      244
                651- 700    6517             2301-2400      164
                701- 750    5321             2401-2500      113
                751- 800    3918             >2500          860
                801- 850    3418
                851- 900    3641
                901- 950    2986
                951-1000    2114


   The average sequence length in UniProtKB/Swiss-Prot is 359 amino acids.

   The shortest sequence is   GWA_SEPOF (P83570):     2 amino acids.
   The longest sequence is  TITIN_MOUSE (A2ASS6): 35213 amino acids.


5.  JOURNAL CITATIONS

   Note: the following citation statistics reflect the number of distinct
         journal citations.

   Total number of journals cited in this release of UniProtKB/Swiss-Prot: 1930


   5.1 Table of the frequency of journal citations

        Journals cited 1x:  630
                       2x:  266
                       3x:  133
                       4x:  100
                       5x:   73
                       6x:   54
                       7x:   44
                       8x:   38
                       9x:   34
                      10x:   24
                  11- 20x:  150
                  21- 50x:  150
                  51-100x:   91
                    >100x:  143


   5.2  List of the most cited journals in UniProtKB/Swiss-Prot

   Nb    Citations   Journal name
   --    ---------   -------------------------------------------------------------
    1        16362   Journal of Biological Chemistry
    2         7669   Proceedings of the National Academy of Sciences of the U.S.A.
    3         4700   Journal of Bacteriology
    4         4405   Gene
    5         4201   Biochemical and Biophysical Research Communications
    6         4182   Nucleic Acids Research
    7         3749   FEBS Letters
    8         3504   Biochemistry
    9         3483   The EMBO Journal
   10         3128   Molecular and Cellular Biology
   11         3010   European Journal of Biochemistry
   12         2973   Nature
   13         2831   Biochimica et Biophysica Acta
   14         2713   Journal of Molecular Biology
   15         2434   Genomics
   16         2419   Cell
   17         2020   Biochemical Journal
   18         1893   Science
   19         1629   Journal of Virology
   20         1587   Molecular Microbiology
   21         1431   Journal of Cell Biology
   22         1427   Plant Molecular Biology
   23         1290   Molecular and General Genetics
   24         1232   Virology
   25         1208   Nature Genetics
   26         1201   Genes and Development
   27         1196   Human Molecular Genetics
   28         1122   Journal of Biochemistry
   29         1109   Plant Physiology
   30         1108   Oncogene
   31         1104   The American Journal of Human Genetics
   32          985   Development
   33          922   Journal of Immunology
   34          907   Human Mutation
   35          869   Genetics
   36          850   Molecular Biology of the Cell
   37          816   Infection and Immunity
   38          803   Structure
   39          772   Journal of General Virology
   40          757   Archives of Biochemistry and Biophysics
   41          723   Yeast
   42          718   The Plant Cell
   43          701   Blood
   44          672   Microbiology
   45          651   Molecular Cell
   46          617   Developmental Biology
   47          611   Journal of Cell Science
   48          600   FEMS Microbiology Letters
   49          598   Cancer Research
   50          597   The Plant Journal
   51          564   Human Genetics
   52          564   Nature Structural Biology
   53          533   Mechanisms of Development
   54          525   Current Biology
   55          511   Current Genetics
   56          477   Applied and Environmental Microbiology
   57          476   Journal of Neuroscience
   58          467   Acta Crystallographica, Section D
   59          466   Journal of Clinical Investigation
   60          463   Protein Science
   61          462   Neuron
   62          460   Mammalian Genome
   63          423   Immunogenetics
   64          421   The Journal of Experimental Medicine
   65          420   Toxicon
   66          415   Molecular Endocrinology
   67          410   Molecular and Biochemical Parasitology
   68          408   American Journal of Physiology
   69          379   Journal of Neurochemistry
   70          365   Endocrinology
   71          360   Journal of Molecular Evolution
   72          354   DNA and Cell Biology
   73          351   The Journal of Clinical Endocrinology and Metabolism
   74          346   DNA Sequence
   75          332   Molecular Biology and Evolution
   76          315   Bioscience, Biotechnology, and Biochemistry
   77          307   Journal of Medical Genetics
   78          306   Brain Research. Molecular Brain Research
   79          286   Biological Chemistry Hoppe-Seyler
   80          280   Proteins
   81          272   Cytogenetics and Cell Genetics
   82          261   Comparative Biochemistry and Physiology
   83          260   Peptides
   84          256   Journal of Investigative Dermatology
   85          251   Antimicrobial Agents and Chemotherapy
   86          245   Journal of General Microbiology
   87          245   Molecular Pharmacology
   88          240   Biology of Reproduction
   89          239   Plant and Cell Physiology
   90          239   Nature Cell Biology
   91          233   Experimental Cell Research
   92          225   Genome Research
   93          215   Hoppe-Seyler's Zeitschrift fur Physiologische Chemie
   94          213   Virus Research
   95          210   Neurology
   96          197   Developmental Dynamics
   97          194   Molecular Plant-Microbe Interactions
   98          193   RNA
   99          191   DNA Research
  100          188   European Journal of Immunology
  101          185   Biochimie
  102          181   Tissue Antigens
  103          175   Annals of Neurology
  104          174   European Journal of Human Genetics
  105          168   Planta
  106          167   Journal of Human Genetics
  107          166   Genes to Cells
  108          163   Molecular and Cellular Endocrinology
  109          163   Immunity
  110          163   Developmental Cell
  111          159   DNA
  112          155   Molecular Phylogenetics and Evolution
  113          154   American Journal of Medical Genetics
  114          152   Hemoglobin
  115          150   Archives of Microbiology
  116          150   Eukaryotic cell
  117          148   The New England Journal of Medicine
  118          147   Insect Biochemistry and Molecular Biology
  119          146   Bioorganicheskaia Khimiia
  120          139   Investigative Ophthalmology and Visual Science
  121          137   Molecular Reproduction and Development
  122          136   Diabetes
  123          134   Glycobiology
  124          134   Animal Genetics
  125          132   Molecular Immunology
  126          129   General and Comparative Endocrinology
  127          128   Molecular and Cellular Neuroscience
  128          125   International Journal of Cancer
  129          121   Archives of Virology
  130          119   Agricultural and Biological Chemistry
  131          116   The FASEB Journal
  132          112   British Journal of Haematology
  133          112   EMBO Reports
  134          111   Molecular Genetics and Metabolism
  135          111   Clinical Genetics
  136          110   Journal of Protein Chemistry
  137          108   Biological Chemistry
  138          106   Molecular Genetics and Genomics
  139          106   Journal of Cellular Biochemistry
  140          105   Journal of Neuroscience Research
  141          104   Neuroscience Letters
  142          103   Journal of Molecular Endocrinology
  143          103   Journal of Lipid Research
  144          100   Biochemistry and Molecular Biology International


6.  STATISTICS FOR SOME LINE TYPES

The following table summarizes the total number of some UniProtKB/Swiss-Prot lines,
as well as the number of entries with at least one such line, and the
frequency of the lines.

                                   Total    Number of  Average
Line type / subtype                number   entries    per entry
---------------------------------  -------- ---------  ---------

   References (RL)                     716052              1.82
1     Journal                          584653    309924    1.49
2     Submitted to EMBL/GenBank/DDBJ   124370    114305    0.32
3     Submitted to other databases       5069      4680    0.01
4     Book citation                       594       584   <0.01
5     Plant Gene Register                 543       531   <0.01
6     Thesis                              389       387   <0.01
7     Unpublished observations            287       283   <0.01
8     Patent                              141       139   <0.01
9     Worm Breeder's Gazette                6         6   <0.01

Total number of distinct authors cited in UniProtKB/Swiss-Prot: 263407.

   Comments (CC)                      1625064              4.14
1     SIMILARITY                       455111    369189    1.16
2     FUNCTION                         281813    271302    0.72
3     SUBCELLULAR LOCATION             225139    220959    0.57
4     CATALYTIC ACTIVITY               157277    143739    0.40
5     SUBUNIT                          154763    154763    0.39
6     PATHWAY                           91738     79969    0.23
7     COFACTOR                          65345     59933    0.17
8     TISSUE SPECIFICITY                29543     29543    0.08
9     PTM                               29031     23754    0.07
10    MISCELLANEOUS                     26924     24573    0.07
11    DOMAIN                            24285     21420    0.06
12    ALTERNATIVE PRODUCTS              16919     16919    0.04
13    SEQUENCE CAUTION                  10382     10382    0.03
14    INTERACTION                        9471      9471    0.02
15    INDUCTION                          9204      9204    0.02
16    DEVELOPMENTAL STAGE                7584      7584    0.02
17    WEB RESOURCE                       6317      5139    0.02
18    ENZYME REGULATION                  6276      6276    0.02
19    CAUTION                            5356      5249    0.01
20    DISEASE                            4375      3018    0.01
21    MASS SPECTROMETRY                  3571      2713    0.01
22    BIOPHYSICOCHEMICAL PROPERTIES      2236      2236    0.01
23    POLYMORPHISM                        718       688   <0.01
24    RNA EDITING                         544       544   <0.01
25    ALLERGEN                            447       447   <0.01
26    TOXIC DOSE                          379       371   <0.01
27    BIOTECHNOLOGY                       236       234   <0.01
28    PHARMACEUTICAL                       80        80   <0.01

   Features (FT)                      2470799              6.29
1     CHAIN                            398905    388707    1.02
2     TRANSMEM                         269851     55094    0.69
3     METAL                            179485     44921    0.46
4     BINDING                          127583     40359    0.32
5     DOMAIN                           118825     68530    0.30
6     CONFLICT                         108345     37614    0.28
7     STRAND                           106813     10124    0.27
8     MOD_RES                          104497     37238    0.27
9     TOPO_DOM                         104334     21254    0.27
10    HELIX                            103676     10650    0.26
11    ACT_SITE                          94617     56024    0.24
12    CARBOHYD                          86991     22400    0.22
13    DISULFID                          85328     21608    0.22
14    REPEAT                            72946     11107    0.19
15    NP_BIND                           70900     48718    0.18
16    VARIANT                           60913     12807    0.16
17    REGION                            60758     33829    0.15
18    COMPBIAS                          37679     21541    0.10
19    VAR_SEQ                           35538     15081    0.09
20    SIGNAL                            29376     29366    0.07
21    MOTIF                             25878     16782    0.07
22    TURN                              25694      8574    0.07
23    SITE                              25012     14433    0.06
24    ZN_FING                           24643      9992    0.06
25    MUTAGEN                           24327      5884    0.06
26    COILED                            14972      9908    0.04
27    INIT_MET                          12351     12351    0.03
28    NON_TER                           10933      8359    0.03
29    LIPID                              9425      6043    0.02
30    PROPEP                             9382      7808    0.02
31    DNA_BIND                           8799      8132    0.02
32    PEPTIDE                            7590      4655    0.02
33    TRANSIT                            5616      5533    0.01
34    CA_BIND                            3347      1388    0.01
35    CROSSLNK                           3031      2096    0.01
36    NON_CONS                           1432       581   <0.01
37    UNSURE                              667       223   <0.01
38    NON_STD                             340       266   <0.01

   Cross-references (DR)              6953477             17.71
1     InterPro                         954161    365778    2.43
2     EMBL                             678519    383792    1.73
3     GO                               659518    261671    1.68
4     Pfam                             505996    353567    1.29
5     PROSITE                          356139    223270    0.91
6     RefSeq                           355438    325205    0.91
7     GeneID                           341505    324984    0.87
8     KEGG                             300528    280532    0.77
9     GenomeReviews                    256332    238769    0.65
10    HAMAP                            205008    204908    0.52
11    HOGENOM                          198402    198399    0.51
12    TIGRFAMs                         185636    173805    0.47
13    Gene3D                           180250    149065    0.46
14    BioCyc                           145962    139440    0.37
15    PANTHER                          143138    132190    0.36
16    PRINTS                           123838    101263    0.32
17    NMPDR                            117067    117064    0.30
18    PIR                              110967    101279    0.28
19    ProDom                           109281    106447    0.28
20    SMART                            104656     79501    0.27
21    HSSP                              83910     83910    0.21
22    UniGene                           78433     72798    0.20
23    HOVERGEN                          75109     75109    0.19
24    Ensembl                           66712     65180    0.17
25    PIRSF                             58210     58210    0.15
26    ArrayExpress                      53103     53103    0.14
27    PDBsum                            52185     13136    0.13
28    PDB                               52185     13136    0.13
29    SMR                               49807     49807    0.13
30    GermOnline                        41973     41363    0.11
31    TIGR                              31613     30912    0.08
32    CleanEx                           30182     29548    0.08
33    HGNC                              18843     18702    0.05
34    LinkHub                           18105     18105    0.05
35    IntAct                            16471     16471    0.04
36    PhosphoSite                       15991     15991    0.04
37    PharmGKB                          15825     15815    0.04
38    MGI                               15680     15629    0.04
39    MIM                               15171     12072    0.04
40    H-InvDB                           11260      9566    0.03
41    DIP                                9000      8950    0.02
42    MEROPS                             7206      6910    0.02
43    RGD                                6999      6994    0.02
44    TAIR                               6998      6884    0.02
45    SGD                                6640      6538    0.02
46    CYGD                               6628      6523    0.02
47    HPA                                5789      4704    0.01
48    DrugBank                           5326      1627    0.01
49    PeptideAtlas                       5168      5168    0.01
50    GeneDB_Spombe                      4460      4419    0.01
51    EcoGene                            4331      4328    0.01
52    EchoBASE                           4159      4124    0.01
53    WormPep                            3884      3180    0.01
54    FlyBase                            3692      3564    0.01
55    Gramene                            3681      3681    0.01
56    WormBase                           3578      3494    0.01
57    Reactome                           3416      2069    0.01
58    SubtiList                          2819      2818    0.01
59    Orphanet                           2633      1673    0.01
60    dictyBase                          2568      2478    0.01
61    GeneFarm                           2252      2231    0.01
62    ZFIN                               2105      2089    0.01
63    StyGene                            1653      1649   <0.01
64    TubercuList                        1473      1437   <0.01
65    SWISS-2DPAGE                       1182      1182   <0.01
66    PseudoCAP                          1180      1171   <0.01
67    ListiList                          1131      1123   <0.01
68    REPRODUCTION-2DPAGE                1029       941   <0.01
69    AGD                                 769       763   <0.01
70    LegioList                           699       697   <0.01
71    PhotoList                           692       692   <0.01
72    Leproma                             650       647   <0.01
73    PeroxiBase                          503       492   <0.01
74    World-2DPAGE                        495       495   <0.01
75    CGD                                 471       471   <0.01
76    MaizeGDB                            468       463   <0.01
77    ProMEX                              423       423   <0.01
78    DisProt                             397       394   <0.01
79    OGP                                 378       378   <0.01
80    SagaList                            373       372   <0.01
81    REBASE                              351       343   <0.01
82    ECO2DBASE                           351       299   <0.01
83    GlycoSuiteDB                        282       282   <0.01
84    BuruList                            264       264   <0.01
85    PHCI-2DPAGE                         244       244   <0.01
86    VectorBase                          236       229   <0.01
87    BindingDB                           210       210   <0.01
88    MypuList                            198       198   <0.01
89    DOSAC-COBS-2DPAGE                   150       150   <0.01
90    Aarhus/Ghent-2DPAGE                 126        96   <0.01
91    Siena-2DPAGE                        102       102   <0.01
92    HSC-2DPAGE                           85        85   <0.01
93    2DBase-Ecoli                         84        84   <0.01
94    PhosSite                             73        73   <0.01
95    Cornea-2DPAGE                        67        67   <0.01
96    COMPLUYEAST-2DPAGE                   59        59   <0.01
97    euHCVdb                              55        44   <0.01
98    PMMA-2DPAGE                          52        52   <0.01
99    PptaseDB                             31        31   <0.01
100   Rat-heart-2DPAGE                     28        28   <0.01
101   ANU-2DPAGE                           22        22   <0.01

Number of explicitly cross-referenced databases: 102
Number of implicitly cross-referenced databases:  23


7.  MISCELLANEOUS STATISTICS

Total number of distinct authors cited in UniProtKB/Swiss-Prot: 254724

Total number of entries encoded on a Mitochondrion: 4375
Total number of entries encoded on a Plasmid: 3430
Total number of entries encoded on a Plastid: 9853
Total number of entries encoded on a Plastid; Apicoplast: 16
Total number of entries encoded on a Plastid; Chloroplast: 9444
Total number of entries encoded on a Plastid; Cyanelle: 145
Total number of entries encoded on a Plastid; Non-photosynthetic plastid: 118

Number of fragments: 8097
Number of additional sequences produced by alternative splicing, initiation or promoter usage: 26284



UniProtKB/TrEMBL protein database release 39.0 statistics


1.  INTRODUCTION

Release 39.0 of 22-Jul-2008 of UniProtKB/TrEMBL contains 6'070'085 sequence entries
comprising 624'149'168 amino acids.

815'041 sequences have been added since release 38, the sequence data of
6'451 existing entries has been updated and the annotations of
5'255'044 entries have been revised. This represents an increase of 15%.



2.  AMINO ACID COMPOSITION

   2.1  Composition in percent for the complete database
   
   Ala (A) 8.57   Gln (Q) 3.89   Leu (L) 9.85   Ser (S) 6.77
   Arg (R) 5.53   Glu (E) 6.06   Lys (K) 5.23   Thr (T) 5.60
   Asn (N) 4.19   Gly (G) 7.07   Met (M) 2.42   Trp (W) 1.34
   Asp (D) 5.26   His (H) 2.20   Phe (F) 4.04   Tyr (Y) 3.03
   Cys (C) 1.33   Ile (I) 5.96   Pro (P) 4.81   Val (V) 6.66

   Asx (B) 0.000  Glx (Z) 0.000  Xaa (X) 0.07


   2.2  Classification of the amino acids by their frequency

   Leu, Ala, Gly, Ser, Val, Glu, Ile, Thr, Arg, Asp, Lys, Pro, Asn, Phe,
   Gln, Tyr, Met, His, Trp, Cys


3.  TAXONOMIC ORIGIN

   Total number of species represented in this release of UniProtKB/TrEMBL: 170489

   The first twenty species represent 954002 sequences:  15.7 % of the
   total number of entries.


   3.1 Table of the frequency of occurrence of species

        Species represented 1x:78172
                            2x:30985
                            3x:16319
                            4x: 9281
                            5x: 5325
                            6x: 3971
                            7x: 2943
                            8x: 2395
                            9x: 1875
                           10x: 2217
                       11- 20x: 9774
                       21- 50x: 3492
                       51-100x: 1416
                         >100x: 2324



   3.2  Table of the most represented species

  ------  ---------  --------------------------------------------
  Number  Frequency  Species
  ------  ---------  --------------------------------------------
       1     238675  Human immunodeficiency virus 1
       2      95231  Oryza sativa subsp. japonica (Rice)
       3      54861  Homo sapiens (Human)
       4      54323  Vitis vinifera (Grape)
       5      50188  Trichomonas vaginalis G3
       6      44675  Mus musculus (Mouse)
       7      44524  Arabidopsis thaliana (Mouse-ear cress)
       8      42163  Hepatitis C virus
       9      39808  Paramecium tetraurelia
      10      39254  Oryza sativa subsp. indica (Rice)
      11      35653  Physcomitrella patens subsp. patens
      12      28243  Drosophila melanogaster (Fruit fly)
      13      28067  Tetraodon nigroviridis (Green puffer)
      14      27250  uncultured bacterium
      15      24942  Danio rerio (Zebrafish) (Brachydanio rerio)
      16      24842  Nematostella vectensis (Starlet sea anemone)
      17      20534  Caenorhabditis elegans
      18      20490  Trypanosoma cruzi
      19      20180  Culex quinquefasciatus (Southern house mosquito)
      20      20099  Hepatitis B virus (HBV)
      21      19172  Caenorhabditis briggsae
      22      17883  Laccaria bicolor (strain S238N-H82) (Bicoloured deceiver) 
      23      16803  Aedes aegypti (Yellowfever mosquito)
      24      16685  Tetrahymena thermophila SB210
      25      16302  Botryotinia fuckeliana (strain B05.10) (Noble rot fungus) (Botrytis cinerea)
      26      15880  Phaeosphaeria nodorum (Septoria nodorum)
      27      14718  Chlamydomonas reinhardtii
      28      14679  Plasmodium chabaudi
      29      14325  Sclerotinia sclerotiorum (strain ATCC 18683 / 1980 / Ss-1) (White mold) 
      30      14158  Anopheles gambiae (African malaria mosquito)
      31      14036  Aspergillus niger
      32      13492  Coprinopsis cinerea (strain Okayama-7 / 130 / FGSC 9003) (Inky cap fungus) 
      33      12757  Magnaporthe grisea (Rice blast fungus) (Pyricularia grisea)
      34      12419  Xenopus laevis (African clawed frog)
      35      12062  Pyrenophora tritici-repentis Pt-1C-BFP
      36      11941  Aspergillus oryzae
      37      11788  Plasmodium berghei
      38      11698  Dictyostelium discoideum (Slime mold)
      39      11570  Brugia malayi (Filarial nematode worm)
      40      10914  Chaetomium globosum (Soil fungus)
      41      10714  Podospora anserina
      42      10426  Neurospora crassa
      43      10323  Coccidioides immitis
      44      10318  Hepatitis C virus subtype 1b
      45      10267  Aspergillus terreus (strain NIH 2624)
      46      10262  Neosartorya fischeri  (Aspergillus fischerianus 
      47      10040  Escherichia coli
      48       9990  Drosophila pseudoobscura (Fruit fly)
      49       9905  Aspergillus fumigatus (strain CEA10 / CBS 144.89 / FGSC A1163) 
      50       9896  Bos taurus (Bovine)
      51       9834  Schistosoma japonicum (Blood fluke)
      52       9799  Xenopus tropicalis (Western clawed frog) (Silurana tropicalis)
      53       9673  Cryptococcus neoformans (Filobasidiella neoformans)
      54       9650  Aspergillus fumigatus (Sartorya fumigata)
      55       9469  Trypanosoma brucei
      56       9456  Emericella nidulans (Aspergillus nidulans)
      57       9287  Candida albicans (Yeast)
      58       9227  Monosiga brevicollis (Choanoflagellate)
      59       9203  Ajellomyces capsulata (strain NAm1 / WU24) (Darling's disease fungus) 
      60       9201  Sorangium cellulosum (strain So ce56) (Polyangium cellulosum (strain So ce56))
      61       8983  Aspergillus clavatus
      62       8826  Rhodococcus sp. (strain RHA1)
      63       8781  Rattus norvegicus (Rat)
      64       8607  Entamoeba dispar SAW760
      65       8603  Methylobacterium nodulans ORS 2060
      66       8513  Stigmatella aurantiaca DW4/3-1
      67       8475  Simian immunodeficiency virus (isolate CPZ GAB1) (SIV-cpz) 
      68       8437  Plesiocystis pacifica SIR-1
      69       8398  Helicobacter pylori (Campylobacter pylori)
      70       8249  Microscilla marina ATCC 23134
      71       8205  Burkholderia xenovorans (strain LB400)
      72       8129  Bradyrhizobium japonicum
      73       8027  Leishmania infantum
      74       7970  Ostreococcus tauri
      75       7935  Acaryochloris marina (strain MBIC 11017)
      76       7887  Leishmania braziliensis
      77       7810  Plasmodium yoelii yoelii
      78       7642  Pseudomonas aeruginosa
      79       7575  Solibacter usitatus (strain Ellin6076)
      80       7514  Plasmodium vivax
      81       7503  Streptomyces coelicolor
      82       7501  Rhizobium leguminosarum bv. trifolii WSM1325
      83       7463  Burkholderia phymatum (strain DSM 17167 / STM815)
      84       7463  Plasmodium falciparum
      85       7401  Ostreococcus lucimarinus (strain CCE9901)
      86       7349  Burkholderia pseudomallei 305
      87       7293  Bradyrhizobium sp. (strain BTAi1 / ATCC BAA-1182)
      88       7292  Burkholderia sp. (strain 383) (Burkholderia cepacia 
      89       7274  Clostridium bolteae ATCC BAA-613
      90       7267  Streptomyces avermitilis
      91       7221  Burkholderia multivorans (strain ATCC 17616 / 249)
      92       7197  Burkholderia phytofirmans (strain DSM 17436 / PsJN)
      93       7136  Rhizobium loti (Mesorhizobium loti)
      94       7132  Frankia sp. (strain EAN1pec)
      95       7124  Burkholderia ambifaria MEX-5
      96       7122  Leishmania major
      97       7081  Burkholderia vietnamiensis (strain G4 / LMG 22486) (Burkholderia cepacia 
      98       7061  Myxococcus xanthus (strain DK 1622)
      99       7005  Streptomyces griseus subsp. griseus (strain JCM 4626 / NBRC 13350)
     100       6981  Burkholderia cenocepacia (strain MC0-3)


   3.3  Taxonomic distribution of the sequences


   Kingdom        sequences (% of the database)
    Archaea          117313 (  2%)
    Bacteria        3404071 ( 56%)
    Eukaryota       1895580 ( 31%)
    Viruses          648091 ( 11%)
    Other              5029 ( <1%)



   Within Eukaryota:


    Category            sequences (% of Eukaryota) (% of the complete database)
     Human                  54862 (  3%)           (  1%)
     Other Mammalia        135932 (  7%)           (  2%)
     Other Vertebrata      211928 ( 11%)           (  3%)
     Viridiplantae         484409 ( 26%)           (  8%)
     Fungi                 361583 ( 19%)           (  6%)
     Insecta               190201 ( 10%)           (  3%)
     Nematoda               56736 (  3%)           (  1%)
     Other                 399929 ( 21%)           (  7%)



4.  SEQUENCE SIZE

   Repartition of the sequences by size (excluding fragments)

               From   To  Number             From   To   Number
                  1-  50  208566             1001-1100    40261
                 51- 100  637056             1101-1200    27366
                101- 150  747961             1201-1300    18801
                151- 200  699159             1301-1400    12801
                201- 250  674548             1401-1500    10254
                251- 300  573450             1501-1600     7455
                301- 350  535981             1601-1700     5842
                351- 400  410656             1701-1800     4650
                401- 450  347643             1801-1900     3538
                451- 500  284281             1901-2000     3059
                501- 550  190417             2001-2100     2455
                551- 600  141809             2101-2200     2451
                601- 650  103668             2201-2300     1886
                651- 700   83899             2301-2400     1594
                701- 750   71389             2401-2500     1362
                751- 800   62475             >2500        11470
                801- 850   46437
                851- 900   42027
                901- 950   29461
                951-1000   23945




   The average sequence length in UniProtKB/TrEMBL is   322 amino acids.

   The shortest sequence is Q16047_HUMAN:     4 amino acids.
   The longest sequence is  Q3ASY8_CHLCH: 36805 amino acids.



5.  STATISTICS FOR SOME LINE TYPES

The following table summarizes the total number of some UniProtKB/TrEMBL lines,
as well as the number of entries with at least one such line, and the
frequency of the lines.

                                   Total    Number of  Average
Line type / subtype                number   entries    per entry
---------------------------------  -------- ---------  ---------

References (RL)                    7640409              1.26
   Submitted to EMBL/GenBank/DDBJ  4154948   3514648    0.68
   Journal                         3352515   3093695    0.55
   Thesis                             6880      6824   <0.01
   Book citation                      4356      4312   <0.01
   Submitted to other databases       3480      3473   <0.01
   Other                            118230    116766    0.02

Comments (CC)                      4393499              0.72
   SIMILARITY                      1358723   1235686    0.22
   CAUTION                         1317857   1317857    0.22
   CATALYTIC ACTIVITY               449239    383227    0.07
   FUNCTION                         442937    425981    0.07
   SUBCELLULAR LOCATION             362423    362395    0.06
   PATHWAY                          163108    149477    0.03
   SUBUNIT                          149694    148693    0.02
   COFACTOR                         138898    136672    0.02
   MISCELLANEOUS                      5726      5726   <0.01
   INTERACTION                        4295      4295   <0.01
   DOMAIN                              599       599   <0.01

Features (FT)                      2546720              0.42
   NON_TER                         2098331   1246419    0.35
   CHAIN                            276320    220896    0.05
   SIGNAL                           171508    171508    0.03
   TRANSIT                             561       561   <0.01

Cross-references (DR)             56036818              9.23
   GO                             11116789   3570918    1.83
   InterPro                        9220668   4167387    1.52
   EMBL                            6851513   6062637    1.13
   Pfam                            5198497   3844857    0.86
   RefSeq                          2971744   2875731    0.49
   GeneID                          2957505   2869408    0.49
   PROSITE                         2842142   1869467    0.47
   KEGG                            1860398   1795530    0.31
   Gene3D                          1757519   1506810    0.29
   GenomeReviews                   1596334   1546434    0.26
   PRINTS                          1089660    917737    0.18
   HOGENOM                         1061239   1061235    0.17
   SMART                           1008925    792054    0.17
   NMPDR                            957022    957011    0.16
   TIGRFAMs                         939056    858853    0.15
   PANTHER                          905430    859383    0.15
   ProDom                           700180    668524    0.12
   SMR                              494328    494243    0.08
   HOVERGEN                         316989    316798    0.05
   BioCyc                           304168    291467    0.05
   UniGene                          275255    251204    0.05
   PIRSF                            262205    262205    0.04
   HSSP                             261663    261371    0.04
   TIGR                             198869    191592    0.03
   PIR                              182023    149002    0.03
   Ensembl                          157982    151190    0.03
   ArrayExpress                     100469    100437    0.02
   Gramene                           69959     69959    0.01
   euHCVdb                           47728     47728    0.01
   MGI                               40387     40202    0.01
   FlyBase                           34972     34832    0.01
   HGNC                              29172     29143   <0.01
   VectorBase                        29057     28725   <0.01
   MEROPS                            26390     25729   <0.01
   TAIR                              19447     19396   <0.01
   WormPep                           19423     19320   <0.01
   WormBase                          19414     19320   <0.01
   ZFIN                              16063     16056   <0.01
   LinkHub                           12019     12019   <0.01
   dictyBase                         10181     10179   <0.01
   CGD                                6987      6987   <0.01
   RGD                                5924      4066   <0.01
   PDBsum                             5860      3334   <0.01
   PDB                                5860      3334   <0.01
   IntAct                             5461      5460   <0.01
   LegioList                          5399      5369   <0.01
   ListiList                          4684      4667   <0.01
   PseudoCAP                          4390      4387   <0.01
   PhotoList                          3988      3864   <0.01
   BuruList                           3976      3942   <0.01
   AGD                                3925      3925   <0.01
   REBASE                             3685      3660   <0.01
   TubercuList                        2517      2511   <0.01
   DIP                                2276      2271   <0.01
   PeroxiBase                         2082      2076   <0.01
   SagaList                           1721      1627   <0.01
   PhosphoSite                        1404      1404   <0.01
   Leproma                             957       956   <0.01
   MypuList                            584       580   <0.01
   GeneDB_Spombe                       515       510   <0.01
   ProMEX                              483       483   <0.01
   World-2DPAGE                        418       418   <0.01
   SGD                                 327       327   <0.01
   PeptideAtlas                        194       194   <0.01
   PharmGKB                            121       121   <0.01
   PHCI-2DPAGE                         103       103   <0.01
   Reactome                             67        62   <0.01
   ANU-2DPAGE                           59        59   <0.01
   SWISS-2DPAGE                         29        29   <0.01
   REPRODUCTION-2DPAGE                  16        16   <0.01
   CYGD                                 16        16   <0.01
   PMMA-2DPAGE                           3         3   <0.01
   Siena-2DPAGE                          2         2   <0.01
   COMPLUYEAST-2DPAGE                    1         1   <0.01

Number of explicitly cross-referenced databases: 102


6.  MISCELLANEOUS STATISTICS

Total number of distinct authors cited in UniProtKB/TrEMBL: 266582

Total number of entries encoded on a Mitochondrion: 213912
Total number of entries encoded on a Plasmid: 97022
Total number of entries encoded on a Plastid: 4959
Total number of entries encoded on a Plastid; Apicoplast: 264
Total number of entries encoded on a Plastid; Chloroplast: 74165
Total number of entries encoded on a Plastid; Cyanelle: 7
Total number of entries encoded on a Plastid; Non-photosynthetic plastid: 237

Number of fragments: 1245645


Submissions and Updates

We welcome feedback from our users. We would especially appreciate your notifying us if you find that sequences belonging to your field of expertise are missing from the database. We also would like to be notified about annotations to be updated, if, for example, the function of a protein has been clarified or if new information about post-translational modifications has become available.

Submit new sequence data, updates and corrections at http://www.uniprot.org/support/submissions.shtml

For all queries regarding submissions to UniProtKB and to submit new protein sequence data, please contact:

UniProt Knowledgebase
The EMBL Outstation - The European Bioinformatics Institute
Wellcome Trust Genome Campus
Hinxton
Cambridge CB10 1SD
United Kingdom

Telephone: (+44 1223) 494 462
Telefax: (+44 1223) 494 468
E-mail:


Download information

Minor releases (every 3 weeks)

The latest data of the UniProt Knowledgebase is available in various format (flatfile, XML or FASTA) at http://www.uniprot.org/database/download.shtml. The data is further supplemented by a file containing the sequences of all additional alternative isoforms annotated in UniProtKB/Swiss-Prot. This data set is documented in the file ftp://ftp.uniprot.org/pub/databases/uniprot/current_release/knowledgebase/complete/README.varsplic

Major releases

For users who wish to download the UniProt Knowledgebase only occasionally, we distribute the latest major release (updated 3 times per year) in flatfile format. Previous UniProtKB/Swiss-Prot and UniProtKB/TrEMBL are archived under ftp://ftp.uniprot.org/pub/databases/uniprot/previous_major_releases. The UniProt Knowledgebase major release is also available on DVD from the EBI.


Contact

EMBL Outstation
European Bioinformatics Institute (EBI)
Wellcome Trust Genome Campus
Hinxton
Cambridge CB10 1SD
United Kingdom

Telephone: (+44 1223) 494 444
Fax: (+44 1223) 494 468
Electronic mail address: /
WWW server: http://www.ebi.ac.uk/


Swiss Institute of Bioinformatics (SIB)
Centre Medical Universitaire
1, rue Michel Servet
1211 Geneva 4
Switzerland

Telephone: (+41 22) 379 50 50
Fax: (+41 22) 379 58 58
Electronic mail address:
WWW server: http://www.expasy.org/


Protein Information Resource (PIR)
Georgetown University Medical Center
3300 Whitehaven St., Suite 1200
Washington, DC 20008
United States of America

Telephone: (+1 202) 687 1039
Fax: (+1 202) 687 0057)
Electronic mail address:
WWW server: http://pir.georgetown.edu

Citation

If you want to cite UniProt in a publication, please use the following reference:

The UniProt Consortium
"The Universal Protein Resource (UniProt)"
Nucleic Acids Res. 36:D190-D195(2008) doi:10.1093/nar/gkm895

ExPASy logo ExPASy Home page Site Map Search ExPASy Contact us Swiss-Prot
 Hosted by ch flag SIB Switzerland Mirror sites: Australia  Brazil  Canada  China  Korea
Notice: This page will be replaced with www.uniprot.org. Please send us your feedback!