ExPASy Home page |
Site Map | Search ExPASy | Contact us | Swiss-Prot |
UniProt Knowledgebase |
Release notes UniProtKB release 14.0 of 22-Jul-2008 |
| Content |
|---|
Related documents: UniProtKB user manual, Recent changes, Forthcoming changes.
| Introduction |
|---|
Release 14.0 of the UniProt Knowledgebase is composed of the UniProtKB/Swiss-Prot Protein Knowledgebase release 56.0 and the UniProtKB/TrEMBL Protein Database release 39.0.
More information on these databases can be found in the user manual What is the UniProt Knowledgebase ?.
| UniProtKB/Swiss-Prot protein knowledgebase release 56.0 statistics |
|---|
The growth of the database is summarized below.
| Release | Date | Number of entries | Number of amino acids |
|---|---|---|---|
| 2.0 | 09/86 | 3'939 | 900'163 |
| 3.0 | 11/86 | 4'160 | 969'641 |
| 4.0 | 04/87 | 4'387 | 1'036'010 |
| 5.0 | 09/87 | 5'205 | 1'327'683 |
| 6.0 | 01/88 | 6'102 | 1'653'982 |
| 7.0 | 04/88 | 6'821 | 1'885'771 |
| 8.0 | 08/88 | 7'724 | 2'224'465 |
| 9.0 | 11/88 | 8'702 | 2'498'140 |
| 10.0 | 03/89 | 10'008 | 2'952'613 |
| 11.0 | 07/89 | 10'856 | 3'265'966 |
| 12.0 | 10/89 | 12'305 | 3'797'482 |
| 13.0 | 01/90 | 13'837 | 4'347'336 |
| 14.0 | 04/90 | 15'409 | 4'914'264 |
| 15.0 | 08/90 | 16'941 | 5'486'399 |
| 16.0 | 11/90 | 18'364 | 5'986'949 |
| 17.0 | 02/91 | 20'024 | 6'524'504 |
| 18.0 | 05/91 | 20'772 | 6'792'034 |
| 19.0 | 08/91 | 21'795 | 7'173'785 |
| 20.0 | 11/91 | 22'654 | 7'500'130 |
| 21.0 | 03/92 | 23'742 | 7'866'596 |
| 22.0 | 05/92 | 25'044 | 8'375'696 |
| 23.0 | 08/92 | 26'706 | 9'011'391 |
| 24.0 | 12/92 | 28'154 | 9'545'427 |
| 25.0 | 04/93 | 29'955 | 10'214'020 |
| 26.0 | 07/93 | 31'808 | 10'875'091 |
| 27.0 | 10/93 | 33'329 | 11'484'420 |
| 28.0 | 02/94 | 36'000 | 12'496'420 |
| 29.0 | 06/94 | 38'303 | 13'464'008 |
| 30.0 | 10/94 | 40'292 | 14'147'368 |
| 31.0 | 02/95 | 43'470 | 15'335'248 |
| 32.0 | 11/95 | 49'340 | 17'385'503 |
| 33.0 | 02/96 | 52'205 | 18'531'384 |
| 34.0 | 10/96 | 59'021 | 21'210'389 |
| 35.0 | 11/97 | 69'113 | 25'083'768 |
| 36.0 | 07/98 | 74'019 | 26'840'295 |
| 37.0 | 12/98 | 77'977 | 28'268'293 |
| 38.0 | 07/99 | 80'000 | 29'085'965 |
| 39.0 | 05/00 | 86'593 | 31'411'114 |
| 40.0 | 10/01 | 101'602 | 37'315'215 |
| 41.0 | 02/03 | 122'564 | 44'986'459 |
| 42.0 | 10/03 | 135'850 | 50'046'799 |
| 43.0 | 03/04 | 146'720 | 54'093'154 |
| 44.0 | 07/04 | 153'871 | 56'608'159 |
| 45.0 | 10/04 | 163'235 | 59'631'787 |
| 46.0 | 02/05 | 168'297 | 61'443'278 |
| 47.0 | 05/05 | 181'577 | 65'746'672 |
| 48.0 | 09/05 | 194'317 | 70'391'852 |
| 49.0 | 02/06 | 207'132 | 75'438'310 |
| 50.0 | 05/06 | 222'289 | 81'585'146 |
| 51.0 | 10/06 | 241'242 | 88'541'632 |
| 52.0 | 03/07 | 261'513 | 95'638'062 |
| 53.0 | 05/07 | 269'293 | 98'902'758 |
| 54.0 | 07/07 | 276'256 | 101'466'206 |
| 55.0 | 02/08 | 356'194 | 127'836'513 |
| 56.0 | 07/08 | 392'667 | 141'217'034 |
In rare cases, UniProtKB/Swiss-Prot entries are removed. Deleted entries are almost exclusively Open Reading Frames (ORFs) that have been wrongly predicted to code for proteins. When there is enough evidence that these hypothetical proteins are not real we take the decision to remove them from UniProtKB/Swiss-Prot. In the document delac_sp.txt, you will find a list of all accession numbers which were previously present in UniProtKB/Swiss-Prot, but which have now been deleted from the database.
We have selected a number of organisms that are the target of genome sequencing and/or mapping projects and for which we intend to:
From our efforts to annotate human sequence entries as completely as possible arose the HPI project, and the bacterial model organisms became the focus of the HAMAP project. Here is the current status of the model organisms which are not covered by these two projects:
| Organism | Database cross-references | Index file | Number of sequences |
|---|---|---|---|
| A.thaliana | TAIR | arath.txt | 6'914 |
| C.albicans | None yet | calbican.txt | 727 |
| C.elegans | Wormpep | celegans.txt | 3188 |
| D.discoideum | DictyBase | dicty.txt | 2'479 |
| D.melanogaster | FlyBase | fly.txt | 2'817 |
| M.musculus | MGD | mgdtosp.txt | 15'813 |
| S.cerevisiae | SGD | yeast.txt | 6'553 |
| S.pombe | GeneDB_SPombe | pombe.txt | 4'421 |
1. INTRODUCTION
Release 56.0 of 22-Jul-08 of UniProtKB/Swiss-Prot contains 392667 sequence entries,
comprising 141217034 amino acids abstracted from 172036 references.
36631 sequences have been added since release 55.0, the sequence data of
605 existing entries has been updated and the annotations of
356036 entries have been revised.
Number of fragments: 8097
Number of additional sequences produced by alternative splicing, initiation or promoter usage, or ribosomal frameshifting: 26036
Protein existence:
PE 1: Evidence at protein level 60013 entries
PE 2: Evidence at transcript level 63043 entries
PE 3: Inferred from homology 255230 entries
PE 4: Predicted 13153 entries
PE 5: Uncertain 1228 entries
2. AMINO ACID COMPOSITION
2.1 Composition in percent for the complete database
Ala (A) 8.13 Gln (Q) 3.95 Leu (L) 9.67 Ser (S) 6.67
Arg (R) 5.50 Glu (E) 6.73 Lys (K) 5.88 Thr (T) 5.35
Asn (N) 4.05 Gly (G) 7.04 Met (M) 2.41 Trp (W) 1.09
Asp (D) 5.40 His (H) 2.28 Phe (F) 3.88 Tyr (Y) 2.93
Cys (C) 1.42 Ile (I) 5.92 Pro (P) 4.77 Val (V) 6.82
Asx (B) 0.000 Glx (Z) 0.000 Xaa (X) 0.00
2.2 Classification of the amino acids by their frequency
Phe, Tyr, Met, His, Cys, Trpla, Gly, Val, Glu, Ser, Ile, Lys, Arg, Asp, Thr, Pro, Asn, Gln,
Phe, Tyr, Met, His, Cys, Trp
3. TAXONOMIC ORIGIN
Total number of species represented in this release of UniProtKB/Swiss-Prot: 11471
The first twenty species represent 98378 sequences: 25.1 % of the total
number of entries.
3.1 Table of the frequency of occurrence of species
Species represented 1x: 5236
2x: 1694
3x: 835
4x: 548
5x: 419
6x: 320
7x: 232
8x: 193
9x: 169
10x: 107
11- 20x: 516
21- 50x: 351
51-100x: 139
>100x: 712
3.2 Table of the most represented species
------ --------- --------------------------------------------
Number Frequency Species
------ --------- --------------------------------------------
1 20069 Homo sapiens (Human)
2 15813 Mus musculus (Mouse)
3 7122 Rattus norvegicus (Rat)
4 6914 Arabidopsis thaliana (Mouse-ear cress)
5 6553 Saccharomyces cerevisiae (Baker's yeast)
6 5371 Bos taurus (Bovine)
7 4421 Schizosaccharomyces pombe (Fission yeast)
8 4342 Escherichia coli (strain K12)
9 3188 Caenorhabditis elegans
10 2878 Bacillus subtilis
11 2817 Drosophila melanogaster (Fruit fly)
12 2816 Xenopus laevis (African clawed frog)
13 2479 Dictyostelium discoideum (Slime mold)
14 2194 Danio rerio (Zebrafish) (Brachydanio rerio)
15 2125 Pongo abelii (Sumatran orangutan)
16 2054 Gallus gallus (Chicken)
17 1950 Escherichia coli O157:H7
18 1782 Methanocaldococcus jannaschii (Methanococcus jannaschii)
19 1774 Haemophilus influenzae
20 1716 Oryza sativa subsp. japonica (Rice)
21 1700 Salmonella typhimurium
22 1627 Escherichia coli O6
23 1625 Shigella flexneri
24 1445 Mycobacterium tuberculosis
25 1323 Sus scrofa (Pig)
26 1292 Salmonella typhi
27 1241 Pseudomonas aeruginosa
28 1187 Xenopus tropicalis (Western clawed frog) (Silurana tropicalis)
29 1183 Mycobacterium bovis
30 1121 Macaca fascicularis (Crab eating macaque) (Cynomolgus monkey)
31 990 Synechocystis sp. (strain PCC 6803)
32 981 Archaeoglobus fulgidus
33 953 Yersinia pestis
34 912 Vibrio cholerae
35 909 Acanthamoeba polyphaga mimivirus (APMV)
36 888 Rhizobium meliloti (Sinorhizobium meliloti)
37 873 Oryctolagus cuniculus (Rabbit)
38 866 Salmonella paratyphi A
39 864 Staphylococcus aureus (strain Mu50 / ATCC 700699)
40 863 Staphylococcus aureus (strain N315)
41 835 Staphylococcus aureus (strain MW2)
42 835 Staphylococcus aureus (strain COL)
43 831 Staphylococcus aureus (strain MSSA476)
44 828 Staphylococcus aureus (strain MRSA252)
45 814 Salmonella choleraesuis
46 809 Yersinia pseudotuberculosis
47 808 Escherichia coli O6:K15:H31 (strain 536 / UPEC)
48 808 Shigella sonnei (strain Ss046)
49 765 Shigella boydii serotype 4 (strain Sb227)
50 764 Vibrio parahaemolyticus
51 763 Ashbya gossypii (Yeast) (Eremothecium gossypii)
52 759 Aquifex aeolicus
53 754 Pasteurella multocida
54 748 Shigella dysenteriae serotype 1 (strain Sd197)
55 747 Escherichia coli O9:H4 (strain HS)
56 747 Canis familiaris (Dog)
57 744 Escherichia coli (strain UTI89 / UPEC)
58 743 Escherichia coli O139:H28 (strain E24377A / ETEC)
59 736 Kluyveromyces lactis (Yeast) (Candida sphaerica)
60 727 Candida albicans (Yeast)
61 724 Erwinia carotovora subsp. atroseptica (Pectobacterium atrosepticum)
62 717 Neurospora crassa
63 711 Escherichia coli (strain ATCC 8739 / DSM 1576 / Crooks)
64 707 Streptomyces coelicolor
65 705 Vibrio vulnificus
66 700 Staphylococcus epidermidis (strain ATCC 35984 / RP62A)
67 699 Staphylococcus epidermidis (strain ATCC 12228)
68 694 Candida glabrata (Yeast) (Torulopsis glabrata)
69 692 Photorhabdus luminescens subsp. laumondii
70 689 Bacillus halodurans
71 688 Vibrio vulnificus (strain YJ016)
72 687 Mycoplasma pneumoniae
73 685 Shigella flexneri serotype 5b (strain 8401)
74 671 Pan troglodytes (Chimpanzee)
75 665 Bacillus anthracis
76 655 Yersinia pestis bv. Antiqua (strain Nepal516)
77 654 Anabaena sp. (strain PCC 7120)
78 650 Yersinia enterocolitica serotype O:8 / biotype 1B (strain 8081)
79 649 Yersinia pestis bv. Antiqua (strain Antiqua)
80 647 Mycobacterium leprae
81 639 Pseudomonas syringae pv. tomato
82 637 Pseudomonas putida (strain KT2440)
83 636 Yersinia pseudotuberculosis serotype O:1b (strain IP 31758)
84 630 Escherichia coli O1:K1 / APEC
85 627 Staphylococcus aureus (strain NCTC 8325)
86 620 Escherichia coli
87 618 Salmonella paratyphi B (strain ATCC BAA-1250 / SPB7)
88 617 Bradyrhizobium japonicum
89 613 Treponema pallidum
90 612 Enterobacter sp. (strain 638)
91 609 Zea mays (Maize)
92 599 Klebsiella pneumoniae subsp. pneumoniae (strain ATCC 700721 / MGH 78578)
93 598 Yersinia pestis (strain Pestoides F)
94 595 Methanobacterium thermoautotrophicum
95 592 Bacillus cereus (strain ATCC 14579 / DSM 31)
96 592 Agrobacterium tumefaciens (strain C58 / ATCC 33970)
97 589 Citrobacter koseri (strain ATCC BAA-895 / CDC 4225-83 / SGSC4696)
98 586 Ralstonia solanacearum (Pseudomonas solanacearum)
99 581 Shewanella oneidensis
100 581 Rickettsia prowazekii
101 580 Staphylococcus aureus (strain USA300)
102 579 Helicobacter pylori (Campylobacter pylori)
103 578 Rhizobium loti (Mesorhizobium loti)
104 575 Serratia proteamaculans (strain 568)
105 572 Buchnera aphidicola subsp. Acyrthosiphon pisum
106 569 Listeria monocytogenes
107 567 Staphylococcus aureus (strain bovine RF122 / ET3-1)
108 566 Lactococcus lactis subsp. lactis (Streptococcus lactis)
109 562 Buchnera aphidicola subsp. Schizaphis graminum
110 561 Listeria innocua
111 560 Photobacterium profundum (Photobacterium sp. (strain SS9))
112 560 Helicobacter pylori J99 (Campylobacter pylori J99)
113 559 Neisseria meningitidis serogroup B
114 556 Xanthomonas campestris pv. campestris
115 554 Salmonella arizonae (strain ATCC BAA-731 / CDC346-86 / RSK2980)
116 546 Staphylococcus haemolyticus (strain JCSC1435)
117 541 Staphylococcus saprophyticus subsp. saprophyticus
118 540 Neisseria meningitidis serogroup A
119 538 Brucella melitensis
120 535 Brucella suis
121 534 Bacillus cereus (strain ATCC 10987)
122 532 Yarrowia lipolytica (Candida lipolytica)
123 531 Clostridium acetobutylicum
124 529 Enterobacter sakazakii (strain ATCC BAA-894)
125 528 Caulobacter crescentus (Caulobacter vibrioides)
126 521 Emericella nidulans (Aspergillus nidulans)
127 521 Debaryomyces hansenii (Yeast) (Torulaspora hansenii)
128 521 Xanthomonas axonopodis pv. citri
129 515 Oceanobacillus iheyensis
130 514 Bacillus thuringiensis subsp. konkukian
131 509 Pseudomonas syringae pv. syringae (strain B728a)
132 507 Buchnera aphidicola subsp. Baizongia pistaciae
133 507 Streptococcus pneumoniae
134 504 Vibrio fischeri (strain ATCC 700601 / ES114)
135 503 Pseudomonas fluorescens (strain PfO-1)
136 502 Bacillus cereus (strain ZK / E33L)
137 502 Listeria monocytogenes serotype 4b (strain F2365)
138 501 Pseudomonas aeruginosa (strain UCBPP-PA14)
139 499 Xylella fastidiosa
140 498 Pseudomonas fluorescens (strain Pf-5 / ATCC BAA-477)
141 497 Thermotoga maritima
142 493 Bacillus licheniformis (strain DSM 13 / ATCC 14580)
143 493 Bordetella bronchiseptica (Alcaligenes bronchisepticus)
144 491 Rickettsia conorii
145 490 Xylella fastidiosa (strain Temecula1 / ATCC 700964)
146 488 Pseudomonas syringae pv. phaseolicola (strain 1448A / Race 6)
147 483 Mycoplasma genitalium
148 481 Bordetella parapertussis
149 481 Chromobacterium violaceum
150 481 Haemophilus ducreyi
151 480 Bordetella pertussis
152 478 Deinococcus radiodurans
153 475 Sodalis glossinidius (strain morsitans)
154 473 Clostridium perfringens
155 470 Corynebacterium glutamicum (Brevibacterium flavum)
156 467 Vibrio cholerae serotype O1 (strain ATCC 39541 / Ogawa 395 / O395)
157 464 Methanosarcina acetivorans
158 461 Brucella abortus
159 458 Haemophilus influenzae (strain 86-028NP)
160 456 Pyrococcus horikoshii
161 456 Mannheimia succiniciproducens (strain MBEL55E)
162 455 Pseudomonas entomophila (strain L48)
163 452 Pyrococcus abyssi
164 452 Streptomyces avermitilis
165 451 Xanthomonas campestris pv. campestris (strain 8004)
166 450 Burkholderia pseudomallei (Pseudomonas pseudomallei)
167 448 Pseudomonas aeruginosa (strain PA7)
168 448 Enterococcus faecalis (Streptococcus faecalis)
169 448 Halobacterium salinarium (Halobacterium halobium)
170 447 Bacillus clausii (strain KSM-K16)
171 446 Rickettsia felis (Rickettsia azadi)
172 444 Streptococcus pneumoniae (strain ATCC BAA-255 / R6)
173 444 Methanosarcina mazei (Methanosarcina frisia)
174 442 Shewanella sp. (strain MR-7)
175 441 Synechococcus elongatus (Thermosynechococcus elongatus)
176 441 Geobacillus kaustophilus
177 440 Lactobacillus plantarum
178 440 Vibrio harveyi (strain ATCC BAA-1116 / BB120)
179 439 Shewanella sp. (strain MR-4)
180 436 Streptococcus mutans
181 436 Chlamydia trachomatis
182 434 Thermoanaerobacter tengcongensis
183 434 Oryza sativa subsp. indica (Rice)
184 433 Rickettsia bellii (strain RML369-C)
185 433 Pyrococcus furiosus
186 432 Ovis aries (Sheep)
187 432 Synechococcus elongatus (strain PCC 7942) (Anacystis nidulans R2)
188 430 Brucella abortus (strain 2308)
189 429 Streptococcus pyogenes serotype M6
190 428 Acinetobacter sp. (strain ADP1)
191 427 Borrelia burgdorferi (Lyme disease spirochete)
192 427 Burkholderia mallei (Pseudomonas mallei)
193 427 Nicotiana tabacum (Common tobacco)
194 426 Rhodopseudomonas palustris
195 424 Anabaena variabilis (strain ATCC 29413 / PCC 7937)
196 423 Burkholderia sp. (strain 383) (Burkholderia cepacia
197 422 Campylobacter jejuni
198 421 Xanthomonas campestris pv. vesicatoria (strain 85-10)
199 420 Pseudomonas putida (strain F1 / ATCC 700007)
200 419 Chlamydia pneumoniae (Chlamydophila pneumoniae)
201 416 Ralstonia eutropha (strain JMP134) (Alcaligenes eutrophus)
202 414 Staphylococcus aureus (strain Newman)
203 414 Shewanella frigidimarina (strain NCIMB 400)
204 414 Aspergillus fumigatus (Sartorya fumigata)
205 413 Shewanella sp. (strain ANA-3)
206 412 Xanthomonas oryzae pv. oryzae (strain MAFF 311018)
207 412 Pseudomonas putida (strain GB-1)
208 410 Methylococcus capsulatus
209 409 Chlamydia muridarum
210 409 Streptococcus pyogenes serotype M1
211 408 Rhizobium sp. (strain NGR234)
212 408 Ralstonia eutropha (Cupriavidus necator
213 407 Sulfolobus solfataricus
214 405 Rhodobacter sphaeroides (strain ATCC 17023 / 2.4.1 / NCIB 8253 / DSM 158)
215 405 Streptococcus pyogenes serotype M18
216 403 Rickettsia typhi
217 403 Streptococcus pyogenes serotype M3
218 402 Bacillus amyloliquefaciens (strain FZB42)
219 400 Shewanella baltica (strain OS185)
220 400 Nitrosomonas europaea
221 398 Gloeobacter violaceus
222 398 Staphylococcus aureus (strain Mu3 / ATCC 700698)
223 397 Hahella chejuensis (strain KCTC 2396)
224 397 Solanum lycopersicum (Tomato) (Lycopersicon esculentum)
225 395 Aeromonas hydrophila subsp. hydrophila (strain ATCC 7966 / NCIB 9240)
226 395 Pseudoalteromonas haloplanktis (strain TAC 125)
227 393 Corynebacterium efficiens
228 392 Dechloromonas aromatica (strain RCB)
229 389 Neisseria gonorrhoeae (strain ATCC 700825 / FA 1090)
230 389 Chlorobium tepidum
231 389 Shewanella sp. (strain W3-18-1)
232 389 Colwellia psychrerythraea (strain 34H / ATCC BAA-681) (Vibrio psychroerythus)
233 388 Shewanella putrefaciens (strain CN-32 / ATCC BAA-453)
234 387 Burkholderia xenovorans (strain LB400)
235 385 Pseudomonas mendocina (strain ymp)
236 385 Synechococcus sp. (strain ATCC 27144 / PCC 6301 / SAUG 1402/1)
237 384 Mycobacterium paratuberculosis
238 384 Idiomarina loihiensis
239 382 Shewanella denitrificans (strain OS217 / ATCC BAA-1090 / DSM 15013)
240 382 Shewanella baltica (strain OS195)
241 381 Haemophilus influenzae (strain PittEE)
242 381 Synechococcus sp. (strain WH8102)
243 381 Pyrococcus kodakaraensis (Thermococcus kodakaraensis)
244 380 Burkholderia thailandensis (strain E264 / ATCC 700388 / DSM 13276 / CIP 106301)
245 380 Aeromonas salmonicida (strain A449)
246 379 Shewanella baltica (strain OS155 / ATCC BAA-1091)
247 377 Actinobacillus pleuropneumoniae serotype 5b (strain L20)
248 374 Solanum tuberosum (Potato)
249 374 Shewanella amazonensis (strain ATCC BAA-1098 / SB2B)
250 374 Burkholderia cenocepacia (strain AU 1054)
251 372 Prochlorococcus marinus (strain MIT 9313)
252 372 Azoarcus sp. (strain EbN1) (Aromatoleum aromaticum (strain EbN1))
253 372 Streptococcus agalactiae serotype III
254 371 Burkholderia pseudomallei (strain 1710b)
255 370 Xanthomonas oryzae pv. oryzae
256 369 Staphylococcus aureus (strain JH1)
257 369 Shewanella loihica (strain ATCC BAA-1088 / PV-4)
258 368 Streptococcus agalactiae serotype V
259 368 Coxiella burnetii
260 367 Methanopyrus kandleri
261 367 Listeria welshimeri serovar 6b (strain ATCC 35897 / DSM 20650 / SLCC5334)
262 365 Rhizobium etli (strain CFN 42 / ATCC 51251)
263 365 Bacillus cereus subsp. cytotoxis (strain NVH 391-98)
264 364 Prochlorococcus marinus
265 363 Staphylococcus aureus (strain JH9)
266 363 Leptospira interrogans
267 363 Geobacter sulfurreducens
268 357 Aeropyrum pernix
269 356 Haemophilus somnus (strain 129Pt) (Histophilus somni (strain 129Pt))
270 356 Nitrosococcus oceani (strain ATCC 19707 / NCIMB 11848)
271 355 Haemophilus influenzae (strain PittGG)
272 353 Leptospira interrogans serogroup Icterohaemorrhagiae serovar copenhageni
273 352 Burkholderia cenocepacia (strain HI2424)
274 352 Shewanella halifaxensis (strain HAW-EB4)
275 352 Thermus thermophilus (strain HB8 / ATCC 27634 / DSM 579)
276 351 Ralstonia metallidurans (strain CH34 / ATCC 43123 / DSM 2839)
277 351 Rhizobium leguminosarum bv. viciae (strain 3841)
278 351 Pisum sativum (Garden pea)
279 349 Legionella pneumophila (strain Paris)
280 348 Bacillus pumilus (strain SAFR-032)
281 348 Legionella pneumophila (strain Lens)
282 348 Chromohalobacter salexigens (strain DSM 3043 / ATCC BAA-138 / NCIMB 13768)
283 347 Sulfolobus tokodaii
284 346 Actinobacillus succinogenes (strain ATCC 55618 / 130Z)
285 345 Thiobacillus denitrificans (strain ATCC 25259)
286 345 Nocardia farcinica
287 345 Psychromonas ingrahamii (strain 37)
288 345 Shewanella pealeana (strain ATCC 700345 / ANG-SQ1)
289 345 Prochlorococcus marinus subsp. pastoris (strain CCMP1378 / MED4)
290 343 Glycine max (Soybean)
291 342 Mycobacterium tuberculosis (strain ATCC 25177 / H37Ra)
292 342 Neisseria meningitidis serogroup C / serotype 2a (strain ATCC 700532 / FAM18)
293 342 Legionella pneumophila subsp. pneumophila
294 340 Saccharophagus degradans (strain 2-40 / ATCC 43961 / DSM 17024)
295 339 Silicibacter pomeroyi
296 339 Desulfovibrio vulgaris (strain Hildenborough / ATCC 29579 / NCIMB 8303)
297 339 Burkholderia ambifaria (strain ATCC BAA-244 / AMMD) (Burkholderia cepacia
298 338 Pseudoalteromonas atlantica (strain T6c / BAA-1087)
299 338 Shewanella sediminis (strain HAW-EB3)
300 336 Macaca mulatta (Rhesus macaque)
301 332 Geobacillus thermodenitrificans (strain NG80-2)
302 331 Staphylococcus aureus (strain USA300 / TCH1516)
303 331 Caenorhabditis briggsae
304 331 Rhodopirellula baltica
305 330 Mycobacterium bovis (strain BCG / Pasteur 1173P2)
306 329 Burkholderia vietnamiensis (strain G4 / LMG 22486) (Burkholderia cepacia
307 329 Lactococcus lactis subsp. cremoris (strain MG1363)
308 329 Nitrosospira multiformis (strain ATCC 25196 / NCIMB 11849)
309 329 Bordetella avium (strain 197N)
310 328 Pseudomonas stutzeri (strain A1501)
311 328 Rhodoferax ferrireducens (strain DSM 15236 / ATCC BAA-621 / T118)
312 327 Symbiobacterium thermophilum
313 326 Zymomonas mobilis
314 326 Fusobacterium nucleatum subsp. nucleatum
315 324 Burkholderia pseudomallei (strain 1106a)
316 322 Clostridium perfringens (strain ATCC 13124 / NCTC 8237 / Type A)
317 322 Thermoplasma acidophilum
318 321 Thermus thermophilus (strain HB27 / ATCC BAA-163 / DSM 7039)
319 321 Wolinella succinogenes
320 321 Methanococcus maripaludis
321 321 Rhodospirillum rubrum (strain ATCC 11170 / NCIB 8255)
322 320 Alcanivorax borkumensis (strain SK2 / ATCC 700651 / DSM 11573)
323 319 Bacillus thuringiensis (strain Al Hakam)
324 319 Methylobacillus flagellatus (strain KT / ATCC 51484 / DSM 6875)
325 319 Geobacter metallireducens (strain GS-15 / ATCC 53774 / DSM 7210)
326 318 Triticum aestivum (Wheat)
327 318 Streptococcus agalactiae serotype Ia
328 318 Bacteroides thetaiotaomicron
329 317 Rhodopseudomonas palustris (strain HaA2)
330 316 Corynebacterium diphtheriae
331 316 Pelobacter carbinolicus (strain DSM 2380 / Gra Bd 1)
332 315 Burkholderia pseudomallei (strain 668)
333 315 Rhodopseudomonas palustris (strain BisB18)
334 315 Sinorhizobium medicae (strain WSM419) (Ensifer medicae)
335 315 Azoarcus sp. (strain BH72)
336 314 Marinobacter aquaeolei (Marinobacter hydrocarbonoclasticus
337 313 Clostridium tetani
338 313 Burkholderia mallei (strain NCTC 10247)
339 312 Methanosarcina barkeri (strain Fusaro / DSM 804)
340 312 Brucella canis (strain ATCC 23365 / NCTC 10854)
341 312 Brucella suis (strain ATCC 23445 / NCTC 10510)
342 311 Hordeum vulgare (Barley)
343 311 Campylobacter jejuni (strain RM1221)
344 311 Nitrobacter winogradskyi (strain Nb-255 / ATCC 25391)
345 310 Thiomicrospira crunogena (strain XCL-2)
346 309 Streptococcus pneumoniae serotype 2 (strain D39 / NCTC 7466)
347 309 Alkalilimnicola ehrlichei (strain MLHE-1)
348 308 Burkholderia mallei (strain NCTC 10229)
349 308 Prochlorococcus marinus (strain NATL2A)
350 305 Clostridium perfringens (strain SM101 / Type A)
351 305 Ochrobactrum anthropi (strain ATCC 49188 / DSM 6882 / NCTC 12168)
352 304 Sulfolobus acidocaldarius
353 304 Rhodopseudomonas palustris (strain BisB5)
354 303 Carboxydothermus hydrogenoformans (strain Z-2901 / DSM 6008)
355 302 Haloarcula marismortui (Halobacterium marismortui)
356 302 Bacteroides fragilis
357 301 Nitrobacter hamburgensis (strain X14 / DSM 10229)
358 300 Burkholderia mallei (strain SAVP1)
359 300 Gluconobacter oxydans (Gluconobacter suboxydans)
360 300 Mesorhizobium sp. (strain BNC1)
361 300 Streptococcus thermophilus (strain CNRZ 1066)
362 298 Roseobacter denitrificans (strain ATCC 33942 / OCh 114) (Erythrobacter sp.
363 298 Streptococcus thermophilus (strain ATCC BAA-250 / LMG 18311)
364 297 Synechococcus sp. (strain CC9902)
365 297 Cryptococcus neoformans (Filobasidiella neoformans)
366 297 Prochlorococcus marinus (strain MIT 9312)
367 295 Staphylococcus aureus
368 295 Bartonella henselae (Rochalimaea henselae)
369 295 Psychrobacter arcticus (strain DSM 17307 / 273-4)
370 294 Pyrobaculum aerophilum
371 294 Nitrosomonas eutropha (strain C91)
372 293 Cavia porcellus (Guinea pig)
373 293 Helicobacter hepaticus
374 291 Lactococcus lactis subsp. cremoris (strain SK11)
375 290 Streptococcus sanguinis (strain SK36)
376 290 Desulfotalea psychrophila
377 289 Streptococcus gordonii (strain Challis / ATCC 35105 / CH1 / DL1 / V288)
378 289 Legionella pneumophila (strain Corby)
379 289 Synechococcus sp. (strain JA-3-3Ab)
380 289 Thermoplasma volcanium
381 289 Bartonella quintana (Rochalimaea quintana)
382 288 Synechococcus sp. (strain CC9605)
383 288 Synechococcus sp. (strain JA-2-3B'a(2-13))
384 287 Moorella thermoacetica (strain ATCC 39073)
385 286 Brucella ovis (strain ATCC 25840 / 63/290 / NCTC 10512)
386 286 Streptococcus pyogenes serotype M28
387 286 Psychrobacter cryohalolentis (strain K5)
388 286 Halorhodospira halophila (strain DSM 244 / SL1) (Ectothiorhodospira halophila
389 285 Pseudomonas putida
390 284 Jannaschia sp. (strain CCS1)
391 284 Streptococcus pyogenes serotype M5 (strain Manfredo)
392 282 Rhodopseudomonas palustris (strain BisA53)
393 282 Haemophilus somnus (strain 2336) (Histophilus somni (strain 2336))
394 282 Lactobacillus sakei subsp. sakei (strain 23K)
395 281 Rhodobacter sphaeroides (strain ATCC 17029 / ATH 2.4.9)
396 280 Trichodesmium erythraeum (strain IMS101)
397 280 Silicibacter sp. (strain TM1040)
398 280 Bifidobacterium longum
399 279 Ustilago maydis (Smut fungus)
400 279 Streptococcus thermophilus (strain ATCC BAA-491 / LMD-9)
401 279 Wigglesworthia glossinidia brevipalpis
402 278 Spinacia oleracea (Spinach)
403 277 Campylobacter jejuni subsp. jejuni serotype O:23/36 (strain 81-176)
404 277 Bradyrhizobium sp. (strain BTAi1 / ATCC BAA-1182)
405 276 Lactobacillus johnsonii
406 275 Campylobacter jejuni subsp. jejuni serotype O:6 (strain 81116 / NCTC 11828)
407 275 Porphyromonas gingivalis (Bacteroides gingivalis)
408 274 Equus caballus (Horse)
409 274 Propionibacterium acnes
410 272 Gorilla gorilla gorilla (Lowland gorilla)
411 272 Polaromonas sp. (strain JS666 / ATCC BAA-500)
412 272 Leifsonia xyli subsp. xyli
413 270 Bacteroides fragilis (strain ATCC 25285 / NCTC 9343)
414 269 Francisella tularensis subsp. tularensis
415 269 Bradyrhizobium sp. (strain ORS278)
416 269 Clostridium botulinum (strain Langeland / NCTC 10281 / Type F)
417 269 Aspergillus oryzae
418 268 Blochmannia floridanus
419 268 Rhodococcus sp. (strain RHA1)
420 268 Bacteriophage T4
421 268 Desulfovibrio desulfuricans (strain G20)
422 268 Acidovorax avenae subsp. citrulli (strain AAC00-1)
423 267 Helicobacter pylori (strain HPAG1)
424 267 Anaeromyxobacter dehalogenans (strain 2CP-C)
425 266 Magnetospirillum magneticum (strain AMB-1 / ATCC 700264)
426 265 Lactobacillus acidophilus
427 265 Clostridium novyi (strain NT)
428 264 Janthinobacterium sp. (strain Marseille) (Minibacterium massiliensis)
429 264 Mycobacterium ulcerans (strain Agy99)
430 264 Chlorobium chlorochromatii (strain CaD3)
431 263 Ureaplasma parvum (Ureaplasma urealyticum biotype 1)
432 263 Neisseria meningitidis serogroup C (strain 053442)
433 262 Rhodobacter capsulatus (Rhodopseudomonas capsulata)
434 262 Paracoccus denitrificans (strain Pd 1222)
435 262 Streptococcus pyogenes serotype M12 (strain MGAS9429)
436 261 Streptococcus pyogenes serotype M4 (strain MGAS10750)
437 260 Corynebacterium glutamicum (strain R)
438 260 Desulfitobacterium hafniense (strain Y51)
439 260 Chlamydophila caviae
440 258 Streptococcus pyogenes serotype M2 (strain MGAS10270)
441 258 Polaromonas naphthalenivorans (strain CJ2)
442 257 Myxococcus xanthus (strain DK 1622)
443 257 Clostridium beijerinckii (strain ATCC 51743 / NCIMB 8052)
444 257 Francisella tularensis subsp. holarctica (strain LVS)
445 257 Prochlorococcus marinus (strain MIT 9301)
446 257 Mycobacterium smegmatis (strain ATCC 700084 / mc(2)155)
447 257 Synechococcus sp. (strain CC9311)
448 256 Thermotoga petrophila (strain RKU-1 / ATCC BAA-488 / DSM 13995)
449 256 Herminiimonas arsenicoxydans
450 256 Pelodictyon luteolum (strain DSM 273) (Chlorobium luteolum (strain DSM 273))
451 255 Acidovorax sp. (strain JS42)
452 255 Clostridium thermocellum (strain ATCC 27405 / DSM 1237)
453 255 Prochlorococcus marinus (strain MIT 9515)
454 255 Synechococcus sp. (strain WH7803)
455 255 Mycobacterium avium (strain 104)
456 254 Clostridium botulinum (strain ATCC 19397 / Type A)
457 254 Vaccinia virus (strain Copenhagen) (VACV)
458 253 Thermobifida fusca (strain YX)
459 253 Corynebacterium jeikeium (strain K411)
460 253 Novosphingobium aromaticivorans (strain DSM 12444)
461 252 Prochlorococcus marinus (strain AS9601)
462 252 Mycobacterium vanbaalenii (strain DSM 7251 / PYR-1)
463 251 Mycobacterium sp. (strain MCS)
464 250 Lactobacillus salivarius subsp. salivarius (strain UCC118)
465 250 Bdellovibrio bacteriovorus
466 249 Rhodobacter sphaeroides (strain ATCC 17025 / ATH 2.4.3)
467 248 Methylibium petroleiphilum (strain PM1)
468 248 Clostridium kluyveri (strain ATCC 8527 / DSM 555 / NCIMB 10680)
469 248 Campylobacter jejuni subsp. doylei (strain ATCC BAA-1458 / RM4099 / 269.97)
470 247 Alkaliphilus metalliredigens (strain QYMF)
471 246 Blochmannia pennsylvanicus (strain BPEN)
472 246 Prochlorococcus marinus (strain NATL1A)
473 246 Marinomonas sp. (strain MWYL1)
474 245 Prochlorococcus marinus (strain MIT 9215)
475 245 Azorhizobium caulinodans (strain ATCC 43989 / DSM 5975 / ORS 571)
476 244 Coxiella burnetii (strain Dugway 5J108-111)
477 244 Sulfurimonas denitrificans (Thiomicrospira denitrificans
478 244 Coxiella burnetii (strain RSA 331 / Henzerling II)
479 244 Streptococcus pyogenes serotype M12 (strain MGAS2096)
480 244 Geobacter uraniireducens (strain Rf4) (Geobacter uraniumreducens)
481 243 Mycobacterium sp. (strain KMS)
482 243 Clostridium difficile (strain 630)
483 242 Francisella tularensis subsp. tularensis (strain FSC 198)
484 241 Mycobacterium sp. (strain JLS)
485 241 Desulfovibrio vulgaris subsp. vulgaris (strain DP4)
486 240 Lactobacillus casei (strain ATCC 334)
487 240 Prochlorococcus marinus (strain MIT 9303)
488 239 Francisella tularensis subsp. novicida (strain U112)
489 238 Treponema denticola
490 237 Acaryochloris marina (strain MBIC 11017)
491 237 Bacillus stearothermophilus (Geobacillus stearothermophilus)
492 237 Francisella tularensis subsp. holarctica (strain OSU18)
493 236 Baumannia cicadellinicola subsp. Homalodisca coagulata
494 235 Clostridium botulinum (strain Hall / ATCC 3502 / NCTC 13319 / Type A)
495 235 Natronomonas pharaonis (strain DSM 2160 / ATCC 35678)
496 235 Syntrophus aciditrophicus (strain SB)
497 234 Sphingopyxis alaskensis (Sphingomonas alaskensis)
498 234 Methanococcus vannielii (strain SB / ATCC 35089 / DSM 1224)
499 234 Leptospira borgpetersenii serovar Hardjo-bovis (strain JB197)
500 233 Hyphomonas neptunium (strain ATCC 15444)
501 232 Pediococcus pentosaceus (strain ATCC 25745 / 183-1w)
502 232 Methanococcus maripaludis (strain C7 / ATCC BAA-1331)
503 232 Chlorobium phaeobacteroides (strain DSM 266)
504 231 Chlamydomonas reinhardtii
505 231 Verminephrobacter eiseniae (strain EF01-2)
506 230 Pelobacter propionicus (strain DSM 2379)
507 230 Alkaliphilus oremlandii (strain OhILAs) (Clostridium oremlandii (strain OhILAs))
508 229 Helicobacter acinonychis (strain Sheeba)
509 229 Methanococcus maripaludis (strain C5 / ATCC BAA-1333)
510 229 Maricaulis maris (strain MCS10)
511 229 Deinococcus geothermalis (strain DSM 11300)
512 226 Chlamydia trachomatis (strain A/HAR-13 / ATCC VR-571B)
513 226 Francisella tularensis subsp. tularensis (strain WY96-3418)
514 225 Protochlamydia amoebophila (strain UWE25)
515 224 Cricetulus griseus (Chinese hamster)
516 223 Desulfotomaculum reducens (strain MI-1)
517 223 Francisella tularensis subsp. holarctica (strain FTA)
518 223 Syntrophomonas wolfei subsp. wolfei (strain Goettingen)
519 222 Dinoroseobacter shibae (strain DFL 12)
520 221 Frankia sp. (strain CcI3)
521 221 Caulobacter sp. (strain K31)
522 220 Syntrophobacter fumaroxidans (strain DSM 10017 / MPOB)
523 220 Lactobacillus brevis (strain ATCC 367 / JCM 1170)
524 219 Synechococcus sp. (strain RCC307)
525 219 Bartonella tribocorum (strain CIP 105476 / IBS 506)
526 218 Lactobacillus delbrueckii subsp. bulgaricus (strain ATCC 11842 / DSM 20081)
527 218 Chlamydophila abortus
528 217 Felis silvestris catus (Cat)
529 217 Porphyra purpurea
530 217 Leptospira borgpetersenii serovar Hardjo-bovis (strain L550)
531 217 Bartonella bacilliformis (strain ATCC 35685 / KC583)
532 217 Methanococcoides burtonii (strain DSM 6242)
533 216 Dehalococcoides sp. (strain CBDB1)
534 215 Dehalococcoides ethenogenes (strain 195)
535 215 Rickettsia akari (strain Hartford)
536 214 Klebsiella pneumoniae
537 212 Granulibacter bethesdensis (strain ATCC BAA-1260 / CGDNIH1)
538 212 Parvibaculum lavamentivorans (strain DS-1 / DSM 13023 / NCIMB 13966)
539 211 Rickettsia canadensis (strain McKiel)
540 210 Mycobacterium gilvum (strain PYR-GCK) (Mycobacterium flavescens
541 210 Francisella philomiragia subsp. philomiragia (strain ATCC 25017)
542 210 Anaeromyxobacter sp. (strain Fw109-5)
543 210 Rickettsia rickettsii (strain Sheila Smith)
544 210 Bacteroides vulgatus (strain ATCC 8482 / DSM 1447 / NCTC 11154)
545 209 Gibberella zeae (Fusarium graminearum)
546 209 Streptococcus suis (strain 98HAH33)
547 208 Nitratiruptor sp. (strain SB155-2)
548 208 Porphyra yezoensis
549 208 Caldicellulosiruptor saccharolyticus (strain ATCC 43494 / DSM 8903)
550 207 Pelagibacter ubique
551 206 Magnetococcus sp. (strain MC-1)
552 206 Mesocricetus auratus (Golden hamster)
553 206 Salinibacter ruber (strain DSM 13855)
554 206 Prosthecochloris vibrioformis (Chlorobium vibrioforme subsp. thiosulfatophilum (Chlorobium phaeovibrioides
555 204 Chlamydophila felis (strain Fe/C-56)
556 204 Lactobacillus delbrueckii subsp. bulgaricus (strain ATCC BAA-365)
557 204 Psychrobacter sp. (strain PRwf-1)
558 203 Encephalitozoon cuniculi
559 203 Tropheryma whipplei (strain TW08/27) (Whipple's bacillus)
560 202 Tropheryma whipplei (strain Twist) (Whipple's bacillus)
561 202 Parabacteroides distasonis (strain ATCC 8503 / DSM 20701 / NCTC 11152)
562 202 Lactobacillus reuteri (strain ATCC 23272 / DSM 20016 / F275)
563 201 Acidiphilium cryptum (strain JF-5)
564 201 Sphingomonas wittichii (strain RW1 / DSM 6014 / JCM 10273)
565 201 Vaccinia virus (strain Western Reserve / WR) (VACV)
566 201 Acidobacteria bacterium (strain Ellin345)
567 201 Rubrobacter xylanophilus (strain DSM 9941 / NBRC 16129)
568 200 Picrophilus torridus
569 200 Saccharopolyspora erythraea (strain NRRL 23338)
3.3 Taxonomic distribution of the sequences
Kingdom sequences (% of the database)
Archaea 14694 ( 4%)
Bacteria 224003 ( 57%)
Eukaryota 141583 ( 36%)
Viruses 12387 ( 3%)
Within Eukaryota:
Category sequences (% of Eukaryota) (% of the complete database)
Human 20070 ( 14%) ( 5%)
Other Mammalia 42975 ( 30%) ( 11%)
Other Vertebrata 13982 ( 10%) ( 4%)
Viridiplantae 23475 ( 17%) ( 6%)
Fungi 21941 ( 15%) ( 6%)
Insecta 5528 ( 4%) ( 1%)
Nematoda 3765 ( 3%) ( 1%)
Other 9847 ( 7%) ( 3%)
4. SEQUENCE SIZE
Repartition of the sequences by size (excluding fragments)
From To Number From To Number
1- 50 6645 1001-1100 2928
51- 100 29356 1101-1200 1993
101- 150 41958 1201-1300 1582
151- 200 41301 1301-1400 1428
201- 250 41103 1401-1500 1115
251- 300 36011 1501-1600 556
301- 350 35089 1601-1700 435
351- 400 30808 1701-1800 380
401- 450 25341 1801-1900 352
451- 500 21000 1901-2000 282
501- 550 14954 2001-2100 178
551- 600 10904 2101-2200 246
601- 650 9329 2201-2300 244
651- 700 6517 2301-2400 164
701- 750 5321 2401-2500 113
751- 800 3918 >2500 860
801- 850 3418
851- 900 3641
901- 950 2986
951-1000 2114
The average sequence length in UniProtKB/Swiss-Prot is 359 amino acids.
The shortest sequence is GWA_SEPOF (P83570): 2 amino acids.
The longest sequence is TITIN_MOUSE (A2ASS6): 35213 amino acids.
5. JOURNAL CITATIONS
Note: the following citation statistics reflect the number of distinct
journal citations.
Total number of journals cited in this release of UniProtKB/Swiss-Prot: 1930
5.1 Table of the frequency of journal citations
Journals cited 1x: 630
2x: 266
3x: 133
4x: 100
5x: 73
6x: 54
7x: 44
8x: 38
9x: 34
10x: 24
11- 20x: 150
21- 50x: 150
51-100x: 91
>100x: 143
5.2 List of the most cited journals in UniProtKB/Swiss-Prot
Nb Citations Journal name
-- --------- -------------------------------------------------------------
1 16362 Journal of Biological Chemistry
2 7669 Proceedings of the National Academy of Sciences of the U.S.A.
3 4700 Journal of Bacteriology
4 4405 Gene
5 4201 Biochemical and Biophysical Research Communications
6 4182 Nucleic Acids Research
7 3749 FEBS Letters
8 3504 Biochemistry
9 3483 The EMBO Journal
10 3128 Molecular and Cellular Biology
11 3010 European Journal of Biochemistry
12 2973 Nature
13 2831 Biochimica et Biophysica Acta
14 2713 Journal of Molecular Biology
15 2434 Genomics
16 2419 Cell
17 2020 Biochemical Journal
18 1893 Science
19 1629 Journal of Virology
20 1587 Molecular Microbiology
21 1431 Journal of Cell Biology
22 1427 Plant Molecular Biology
23 1290 Molecular and General Genetics
24 1232 Virology
25 1208 Nature Genetics
26 1201 Genes and Development
27 1196 Human Molecular Genetics
28 1122 Journal of Biochemistry
29 1109 Plant Physiology
30 1108 Oncogene
31 1104 The American Journal of Human Genetics
32 985 Development
33 922 Journal of Immunology
34 907 Human Mutation
35 869 Genetics
36 850 Molecular Biology of the Cell
37 816 Infection and Immunity
38 803 Structure
39 772 Journal of General Virology
40 757 Archives of Biochemistry and Biophysics
41 723 Yeast
42 718 The Plant Cell
43 701 Blood
44 672 Microbiology
45 651 Molecular Cell
46 617 Developmental Biology
47 611 Journal of Cell Science
48 600 FEMS Microbiology Letters
49 598 Cancer Research
50 597 The Plant Journal
51 564 Human Genetics
52 564 Nature Structural Biology
53 533 Mechanisms of Development
54 525 Current Biology
55 511 Current Genetics
56 477 Applied and Environmental Microbiology
57 476 Journal of Neuroscience
58 467 Acta Crystallographica, Section D
59 466 Journal of Clinical Investigation
60 463 Protein Science
61 462 Neuron
62 460 Mammalian Genome
63 423 Immunogenetics
64 421 The Journal of Experimental Medicine
65 420 Toxicon
66 415 Molecular Endocrinology
67 410 Molecular and Biochemical Parasitology
68 408 American Journal of Physiology
69 379 Journal of Neurochemistry
70 365 Endocrinology
71 360 Journal of Molecular Evolution
72 354 DNA and Cell Biology
73 351 The Journal of Clinical Endocrinology and Metabolism
74 346 DNA Sequence
75 332 Molecular Biology and Evolution
76 315 Bioscience, Biotechnology, and Biochemistry
77 307 Journal of Medical Genetics
78 306 Brain Research. Molecular Brain Research
79 286 Biological Chemistry Hoppe-Seyler
80 280 Proteins
81 272 Cytogenetics and Cell Genetics
82 261 Comparative Biochemistry and Physiology
83 260 Peptides
84 256 Journal of Investigative Dermatology
85 251 Antimicrobial Agents and Chemotherapy
86 245 Journal of General Microbiology
87 245 Molecular Pharmacology
88 240 Biology of Reproduction
89 239 Plant and Cell Physiology
90 239 Nature Cell Biology
91 233 Experimental Cell Research
92 225 Genome Research
93 215 Hoppe-Seyler's Zeitschrift fur Physiologische Chemie
94 213 Virus Research
95 210 Neurology
96 197 Developmental Dynamics
97 194 Molecular Plant-Microbe Interactions
98 193 RNA
99 191 DNA Research
100 188 European Journal of Immunology
101 185 Biochimie
102 181 Tissue Antigens
103 175 Annals of Neurology
104 174 European Journal of Human Genetics
105 168 Planta
106 167 Journal of Human Genetics
107 166 Genes to Cells
108 163 Molecular and Cellular Endocrinology
109 163 Immunity
110 163 Developmental Cell
111 159 DNA
112 155 Molecular Phylogenetics and Evolution
113 154 American Journal of Medical Genetics
114 152 Hemoglobin
115 150 Archives of Microbiology
116 150 Eukaryotic cell
117 148 The New England Journal of Medicine
118 147 Insect Biochemistry and Molecular Biology
119 146 Bioorganicheskaia Khimiia
120 139 Investigative Ophthalmology and Visual Science
121 137 Molecular Reproduction and Development
122 136 Diabetes
123 134 Glycobiology
124 134 Animal Genetics
125 132 Molecular Immunology
126 129 General and Comparative Endocrinology
127 128 Molecular and Cellular Neuroscience
128 125 International Journal of Cancer
129 121 Archives of Virology
130 119 Agricultural and Biological Chemistry
131 116 The FASEB Journal
132 112 British Journal of Haematology
133 112 EMBO Reports
134 111 Molecular Genetics and Metabolism
135 111 Clinical Genetics
136 110 Journal of Protein Chemistry
137 108 Biological Chemistry
138 106 Molecular Genetics and Genomics
139 106 Journal of Cellular Biochemistry
140 105 Journal of Neuroscience Research
141 104 Neuroscience Letters
142 103 Journal of Molecular Endocrinology
143 103 Journal of Lipid Research
144 100 Biochemistry and Molecular Biology International
6. STATISTICS FOR SOME LINE TYPES
The following table summarizes the total number of some UniProtKB/Swiss-Prot lines,
as well as the number of entries with at least one such line, and the
frequency of the lines.
Total Number of Average
Line type / subtype number entries per entry
--------------------------------- -------- --------- ---------
References (RL) 716052 1.82
1 Journal 584653 309924 1.49
2 Submitted to EMBL/GenBank/DDBJ 124370 114305 0.32
3 Submitted to other databases 5069 4680 0.01
4 Book citation 594 584 <0.01
5 Plant Gene Register 543 531 <0.01
6 Thesis 389 387 <0.01
7 Unpublished observations 287 283 <0.01
8 Patent 141 139 <0.01
9 Worm Breeder's Gazette 6 6 <0.01
Total number of distinct authors cited in UniProtKB/Swiss-Prot: 263407.
Comments (CC) 1625064 4.14
1 SIMILARITY 455111 369189 1.16
2 FUNCTION 281813 271302 0.72
3 SUBCELLULAR LOCATION 225139 220959 0.57
4 CATALYTIC ACTIVITY 157277 143739 0.40
5 SUBUNIT 154763 154763 0.39
6 PATHWAY 91738 79969 0.23
7 COFACTOR 65345 59933 0.17
8 TISSUE SPECIFICITY 29543 29543 0.08
9 PTM 29031 23754 0.07
10 MISCELLANEOUS 26924 24573 0.07
11 DOMAIN 24285 21420 0.06
12 ALTERNATIVE PRODUCTS 16919 16919 0.04
13 SEQUENCE CAUTION 10382 10382 0.03
14 INTERACTION 9471 9471 0.02
15 INDUCTION 9204 9204 0.02
16 DEVELOPMENTAL STAGE 7584 7584 0.02
17 WEB RESOURCE 6317 5139 0.02
18 ENZYME REGULATION 6276 6276 0.02
19 CAUTION 5356 5249 0.01
20 DISEASE 4375 3018 0.01
21 MASS SPECTROMETRY 3571 2713 0.01
22 BIOPHYSICOCHEMICAL PROPERTIES 2236 2236 0.01
23 POLYMORPHISM 718 688 <0.01
24 RNA EDITING 544 544 <0.01
25 ALLERGEN 447 447 <0.01
26 TOXIC DOSE 379 371 <0.01
27 BIOTECHNOLOGY 236 234 <0.01
28 PHARMACEUTICAL 80 80 <0.01
Features (FT) 2470799 6.29
1 CHAIN 398905 388707 1.02
2 TRANSMEM 269851 55094 0.69
3 METAL 179485 44921 0.46
4 BINDING 127583 40359 0.32
5 DOMAIN 118825 68530 0.30
6 CONFLICT 108345 37614 0.28
7 STRAND 106813 10124 0.27
8 MOD_RES 104497 37238 0.27
9 TOPO_DOM 104334 21254 0.27
10 HELIX 103676 10650 0.26
11 ACT_SITE 94617 56024 0.24
12 CARBOHYD 86991 22400 0.22
13 DISULFID 85328 21608 0.22
14 REPEAT 72946 11107 0.19
15 NP_BIND 70900 48718 0.18
16 VARIANT 60913 12807 0.16
17 REGION 60758 33829 0.15
18 COMPBIAS 37679 21541 0.10
19 VAR_SEQ 35538 15081 0.09
20 SIGNAL 29376 29366 0.07
21 MOTIF 25878 16782 0.07
22 TURN 25694 8574 0.07
23 SITE 25012 14433 0.06
24 ZN_FING 24643 9992 0.06
25 MUTAGEN 24327 5884 0.06
26 COILED 14972 9908 0.04
27 INIT_MET 12351 12351 0.03
28 NON_TER 10933 8359 0.03
29 LIPID 9425 6043 0.02
30 PROPEP 9382 7808 0.02
31 DNA_BIND 8799 8132 0.02
32 PEPTIDE 7590 4655 0.02
33 TRANSIT 5616 5533 0.01
34 CA_BIND 3347 1388 0.01
35 CROSSLNK 3031 2096 0.01
36 NON_CONS 1432 581 <0.01
37 UNSURE 667 223 <0.01
38 NON_STD 340 266 <0.01
Cross-references (DR) 6953477 17.71
1 InterPro 954161 365778 2.43
2 EMBL 678519 383792 1.73
3 GO 659518 261671 1.68
4 Pfam 505996 353567 1.29
5 PROSITE 356139 223270 0.91
6 RefSeq 355438 325205 0.91
7 GeneID 341505 324984 0.87
8 KEGG 300528 280532 0.77
9 GenomeReviews 256332 238769 0.65
10 HAMAP 205008 204908 0.52
11 HOGENOM 198402 198399 0.51
12 TIGRFAMs 185636 173805 0.47
13 Gene3D 180250 149065 0.46
14 BioCyc 145962 139440 0.37
15 PANTHER 143138 132190 0.36
16 PRINTS 123838 101263 0.32
17 NMPDR 117067 117064 0.30
18 PIR 110967 101279 0.28
19 ProDom 109281 106447 0.28
20 SMART 104656 79501 0.27
21 HSSP 83910 83910 0.21
22 UniGene 78433 72798 0.20
23 HOVERGEN 75109 75109 0.19
24 Ensembl 66712 65180 0.17
25 PIRSF 58210 58210 0.15
26 ArrayExpress 53103 53103 0.14
27 PDBsum 52185 13136 0.13
28 PDB 52185 13136 0.13
29 SMR 49807 49807 0.13
30 GermOnline 41973 41363 0.11
31 TIGR 31613 30912 0.08
32 CleanEx 30182 29548 0.08
33 HGNC 18843 18702 0.05
34 LinkHub 18105 18105 0.05
35 IntAct 16471 16471 0.04
36 PhosphoSite 15991 15991 0.04
37 PharmGKB 15825 15815 0.04
38 MGI 15680 15629 0.04
39 MIM 15171 12072 0.04
40 H-InvDB 11260 9566 0.03
41 DIP 9000 8950 0.02
42 MEROPS 7206 6910 0.02
43 RGD 6999 6994 0.02
44 TAIR 6998 6884 0.02
45 SGD 6640 6538 0.02
46 CYGD 6628 6523 0.02
47 HPA 5789 4704 0.01
48 DrugBank 5326 1627 0.01
49 PeptideAtlas 5168 5168 0.01
50 GeneDB_Spombe 4460 4419 0.01
51 EcoGene 4331 4328 0.01
52 EchoBASE 4159 4124 0.01
53 WormPep 3884 3180 0.01
54 FlyBase 3692 3564 0.01
55 Gramene 3681 3681 0.01
56 WormBase 3578 3494 0.01
57 Reactome 3416 2069 0.01
58 SubtiList 2819 2818 0.01
59 Orphanet 2633 1673 0.01
60 dictyBase 2568 2478 0.01
61 GeneFarm 2252 2231 0.01
62 ZFIN 2105 2089 0.01
63 StyGene 1653 1649 <0.01
64 TubercuList 1473 1437 <0.01
65 SWISS-2DPAGE 1182 1182 <0.01
66 PseudoCAP 1180 1171 <0.01
67 ListiList 1131 1123 <0.01
68 REPRODUCTION-2DPAGE 1029 941 <0.01
69 AGD 769 763 <0.01
70 LegioList 699 697 <0.01
71 PhotoList 692 692 <0.01
72 Leproma 650 647 <0.01
73 PeroxiBase 503 492 <0.01
74 World-2DPAGE 495 495 <0.01
75 CGD 471 471 <0.01
76 MaizeGDB 468 463 <0.01
77 ProMEX 423 423 <0.01
78 DisProt 397 394 <0.01
79 OGP 378 378 <0.01
80 SagaList 373 372 <0.01
81 REBASE 351 343 <0.01
82 ECO2DBASE 351 299 <0.01
83 GlycoSuiteDB 282 282 <0.01
84 BuruList 264 264 <0.01
85 PHCI-2DPAGE 244 244 <0.01
86 VectorBase 236 229 <0.01
87 BindingDB 210 210 <0.01
88 MypuList 198 198 <0.01
89 DOSAC-COBS-2DPAGE 150 150 <0.01
90 Aarhus/Ghent-2DPAGE 126 96 <0.01
91 Siena-2DPAGE 102 102 <0.01
92 HSC-2DPAGE 85 85 <0.01
93 2DBase-Ecoli 84 84 <0.01
94 PhosSite 73 73 <0.01
95 Cornea-2DPAGE 67 67 <0.01
96 COMPLUYEAST-2DPAGE 59 59 <0.01
97 euHCVdb 55 44 <0.01
98 PMMA-2DPAGE 52 52 <0.01
99 PptaseDB 31 31 <0.01
100 Rat-heart-2DPAGE 28 28 <0.01
101 ANU-2DPAGE 22 22 <0.01
Number of explicitly cross-referenced databases: 102
Number of implicitly cross-referenced databases: 23
7. MISCELLANEOUS STATISTICS
Total number of distinct authors cited in UniProtKB/Swiss-Prot: 254724
Total number of entries encoded on a Mitochondrion: 4375
Total number of entries encoded on a Plasmid: 3430
Total number of entries encoded on a Plastid: 9853
Total number of entries encoded on a Plastid; Apicoplast: 16
Total number of entries encoded on a Plastid; Chloroplast: 9444
Total number of entries encoded on a Plastid; Cyanelle: 145
Total number of entries encoded on a Plastid; Non-photosynthetic plastid: 118
Number of fragments: 8097
Number of additional sequences produced by alternative splicing, initiation or promoter usage: 26284
| UniProtKB/TrEMBL protein database release 39.0 statistics |
|---|
1. INTRODUCTION
Release 39.0 of 22-Jul-2008 of UniProtKB/TrEMBL contains 6'070'085 sequence entries
comprising 624'149'168 amino acids.
815'041 sequences have been added since release 38, the sequence data of
6'451 existing entries has been updated and the annotations of
5'255'044 entries have been revised. This represents an increase of 15%.
2. AMINO ACID COMPOSITION
2.1 Composition in percent for the complete database
Ala (A) 8.57 Gln (Q) 3.89 Leu (L) 9.85 Ser (S) 6.77
Arg (R) 5.53 Glu (E) 6.06 Lys (K) 5.23 Thr (T) 5.60
Asn (N) 4.19 Gly (G) 7.07 Met (M) 2.42 Trp (W) 1.34
Asp (D) 5.26 His (H) 2.20 Phe (F) 4.04 Tyr (Y) 3.03
Cys (C) 1.33 Ile (I) 5.96 Pro (P) 4.81 Val (V) 6.66
Asx (B) 0.000 Glx (Z) 0.000 Xaa (X) 0.07
2.2 Classification of the amino acids by their frequency
Leu, Ala, Gly, Ser, Val, Glu, Ile, Thr, Arg, Asp, Lys, Pro, Asn, Phe,
Gln, Tyr, Met, His, Trp, Cys
3. TAXONOMIC ORIGIN
Total number of species represented in this release of UniProtKB/TrEMBL: 170489
The first twenty species represent 954002 sequences: 15.7 % of the
total number of entries.
3.1 Table of the frequency of occurrence of species
Species represented 1x:78172
2x:30985
3x:16319
4x: 9281
5x: 5325
6x: 3971
7x: 2943
8x: 2395
9x: 1875
10x: 2217
11- 20x: 9774
21- 50x: 3492
51-100x: 1416
>100x: 2324
3.2 Table of the most represented species
------ --------- --------------------------------------------
Number Frequency Species
------ --------- --------------------------------------------
1 238675 Human immunodeficiency virus 1
2 95231 Oryza sativa subsp. japonica (Rice)
3 54861 Homo sapiens (Human)
4 54323 Vitis vinifera (Grape)
5 50188 Trichomonas vaginalis G3
6 44675 Mus musculus (Mouse)
7 44524 Arabidopsis thaliana (Mouse-ear cress)
8 42163 Hepatitis C virus
9 39808 Paramecium tetraurelia
10 39254 Oryza sativa subsp. indica (Rice)
11 35653 Physcomitrella patens subsp. patens
12 28243 Drosophila melanogaster (Fruit fly)
13 28067 Tetraodon nigroviridis (Green puffer)
14 27250 uncultured bacterium
15 24942 Danio rerio (Zebrafish) (Brachydanio rerio)
16 24842 Nematostella vectensis (Starlet sea anemone)
17 20534 Caenorhabditis elegans
18 20490 Trypanosoma cruzi
19 20180 Culex quinquefasciatus (Southern house mosquito)
20 20099 Hepatitis B virus (HBV)
21 19172 Caenorhabditis briggsae
22 17883 Laccaria bicolor (strain S238N-H82) (Bicoloured deceiver)
23 16803 Aedes aegypti (Yellowfever mosquito)
24 16685 Tetrahymena thermophila SB210
25 16302 Botryotinia fuckeliana (strain B05.10) (Noble rot fungus) (Botrytis cinerea)
26 15880 Phaeosphaeria nodorum (Septoria nodorum)
27 14718 Chlamydomonas reinhardtii
28 14679 Plasmodium chabaudi
29 14325 Sclerotinia sclerotiorum (strain ATCC 18683 / 1980 / Ss-1) (White mold)
30 14158 Anopheles gambiae (African malaria mosquito)
31 14036 Aspergillus niger
32 13492 Coprinopsis cinerea (strain Okayama-7 / 130 / FGSC 9003) (Inky cap fungus)
33 12757 Magnaporthe grisea (Rice blast fungus) (Pyricularia grisea)
34 12419 Xenopus laevis (African clawed frog)
35 12062 Pyrenophora tritici-repentis Pt-1C-BFP
36 11941 Aspergillus oryzae
37 11788 Plasmodium berghei
38 11698 Dictyostelium discoideum (Slime mold)
39 11570 Brugia malayi (Filarial nematode worm)
40 10914 Chaetomium globosum (Soil fungus)
41 10714 Podospora anserina
42 10426 Neurospora crassa
43 10323 Coccidioides immitis
44 10318 Hepatitis C virus subtype 1b
45 10267 Aspergillus terreus (strain NIH 2624)
46 10262 Neosartorya fischeri (Aspergillus fischerianus
47 10040 Escherichia coli
48 9990 Drosophila pseudoobscura (Fruit fly)
49 9905 Aspergillus fumigatus (strain CEA10 / CBS 144.89 / FGSC A1163)
50 9896 Bos taurus (Bovine)
51 9834 Schistosoma japonicum (Blood fluke)
52 9799 Xenopus tropicalis (Western clawed frog) (Silurana tropicalis)
53 9673 Cryptococcus neoformans (Filobasidiella neoformans)
54 9650 Aspergillus fumigatus (Sartorya fumigata)
55 9469 Trypanosoma brucei
56 9456 Emericella nidulans (Aspergillus nidulans)
57 9287 Candida albicans (Yeast)
58 9227 Monosiga brevicollis (Choanoflagellate)
59 9203 Ajellomyces capsulata (strain NAm1 / WU24) (Darling's disease fungus)
60 9201 Sorangium cellulosum (strain So ce56) (Polyangium cellulosum (strain So ce56))
61 8983 Aspergillus clavatus
62 8826 Rhodococcus sp. (strain RHA1)
63 8781 Rattus norvegicus (Rat)
64 8607 Entamoeba dispar SAW760
65 8603 Methylobacterium nodulans ORS 2060
66 8513 Stigmatella aurantiaca DW4/3-1
67 8475 Simian immunodeficiency virus (isolate CPZ GAB1) (SIV-cpz)
68 8437 Plesiocystis pacifica SIR-1
69 8398 Helicobacter pylori (Campylobacter pylori)
70 8249 Microscilla marina ATCC 23134
71 8205 Burkholderia xenovorans (strain LB400)
72 8129 Bradyrhizobium japonicum
73 8027 Leishmania infantum
74 7970 Ostreococcus tauri
75 7935 Acaryochloris marina (strain MBIC 11017)
76 7887 Leishmania braziliensis
77 7810 Plasmodium yoelii yoelii
78 7642 Pseudomonas aeruginosa
79 7575 Solibacter usitatus (strain Ellin6076)
80 7514 Plasmodium vivax
81 7503 Streptomyces coelicolor
82 7501 Rhizobium leguminosarum bv. trifolii WSM1325
83 7463 Burkholderia phymatum (strain DSM 17167 / STM815)
84 7463 Plasmodium falciparum
85 7401 Ostreococcus lucimarinus (strain CCE9901)
86 7349 Burkholderia pseudomallei 305
87 7293 Bradyrhizobium sp. (strain BTAi1 / ATCC BAA-1182)
88 7292 Burkholderia sp. (strain 383) (Burkholderia cepacia
89 7274 Clostridium bolteae ATCC BAA-613
90 7267 Streptomyces avermitilis
91 7221 Burkholderia multivorans (strain ATCC 17616 / 249)
92 7197 Burkholderia phytofirmans (strain DSM 17436 / PsJN)
93 7136 Rhizobium loti (Mesorhizobium loti)
94 7132 Frankia sp. (strain EAN1pec)
95 7124 Burkholderia ambifaria MEX-5
96 7122 Leishmania major
97 7081 Burkholderia vietnamiensis (strain G4 / LMG 22486) (Burkholderia cepacia
98 7061 Myxococcus xanthus (strain DK 1622)
99 7005 Streptomyces griseus subsp. griseus (strain JCM 4626 / NBRC 13350)
100 6981 Burkholderia cenocepacia (strain MC0-3)
3.3 Taxonomic distribution of the sequences
Kingdom sequences (% of the database)
Archaea 117313 ( 2%)
Bacteria 3404071 ( 56%)
Eukaryota 1895580 ( 31%)
Viruses 648091 ( 11%)
Other 5029 ( <1%)
Within Eukaryota:
Category sequences (% of Eukaryota) (% of the complete database)
Human 54862 ( 3%) ( 1%)
Other Mammalia 135932 ( 7%) ( 2%)
Other Vertebrata 211928 ( 11%) ( 3%)
Viridiplantae 484409 ( 26%) ( 8%)
Fungi 361583 ( 19%) ( 6%)
Insecta 190201 ( 10%) ( 3%)
Nematoda 56736 ( 3%) ( 1%)
Other 399929 ( 21%) ( 7%)
4. SEQUENCE SIZE
Repartition of the sequences by size (excluding fragments)
From To Number From To Number
1- 50 208566 1001-1100 40261
51- 100 637056 1101-1200 27366
101- 150 747961 1201-1300 18801
151- 200 699159 1301-1400 12801
201- 250 674548 1401-1500 10254
251- 300 573450 1501-1600 7455
301- 350 535981 1601-1700 5842
351- 400 410656 1701-1800 4650
401- 450 347643 1801-1900 3538
451- 500 284281 1901-2000 3059
501- 550 190417 2001-2100 2455
551- 600 141809 2101-2200 2451
601- 650 103668 2201-2300 1886
651- 700 83899 2301-2400 1594
701- 750 71389 2401-2500 1362
751- 800 62475 >2500 11470
801- 850 46437
851- 900 42027
901- 950 29461
951-1000 23945
The average sequence length in UniProtKB/TrEMBL is 322 amino acids.
The shortest sequence is Q16047_HUMAN: 4 amino acids.
The longest sequence is Q3ASY8_CHLCH: 36805 amino acids.
5. STATISTICS FOR SOME LINE TYPES
The following table summarizes the total number of some UniProtKB/TrEMBL lines,
as well as the number of entries with at least one such line, and the
frequency of the lines.
Total Number of Average
Line type / subtype number entries per entry
--------------------------------- -------- --------- ---------
References (RL) 7640409 1.26
Submitted to EMBL/GenBank/DDBJ 4154948 3514648 0.68
Journal 3352515 3093695 0.55
Thesis 6880 6824 <0.01
Book citation 4356 4312 <0.01
Submitted to other databases 3480 3473 <0.01
Other 118230 116766 0.02
Comments (CC) 4393499 0.72
SIMILARITY 1358723 1235686 0.22
CAUTION 1317857 1317857 0.22
CATALYTIC ACTIVITY 449239 383227 0.07
FUNCTION 442937 425981 0.07
SUBCELLULAR LOCATION 362423 362395 0.06
PATHWAY 163108 149477 0.03
SUBUNIT 149694 148693 0.02
COFACTOR 138898 136672 0.02
MISCELLANEOUS 5726 5726 <0.01
INTERACTION 4295 4295 <0.01
DOMAIN 599 599 <0.01
Features (FT) 2546720 0.42
NON_TER 2098331 1246419 0.35
CHAIN 276320 220896 0.05
SIGNAL 171508 171508 0.03
TRANSIT 561 561 <0.01
Cross-references (DR) 56036818 9.23
GO 11116789 3570918 1.83
InterPro 9220668 4167387 1.52
EMBL 6851513 6062637 1.13
Pfam 5198497 3844857 0.86
RefSeq 2971744 2875731 0.49
GeneID 2957505 2869408 0.49
PROSITE 2842142 1869467 0.47
KEGG 1860398 1795530 0.31
Gene3D 1757519 1506810 0.29
GenomeReviews 1596334 1546434 0.26
PRINTS 1089660 917737 0.18
HOGENOM 1061239 1061235 0.17
SMART 1008925 792054 0.17
NMPDR 957022 957011 0.16
TIGRFAMs 939056 858853 0.15
PANTHER 905430 859383 0.15
ProDom 700180 668524 0.12
SMR 494328 494243 0.08
HOVERGEN 316989 316798 0.05
BioCyc 304168 291467 0.05
UniGene 275255 251204 0.05
PIRSF 262205 262205 0.04
HSSP 261663 261371 0.04
TIGR 198869 191592 0.03
PIR 182023 149002 0.03
Ensembl 157982 151190 0.03
ArrayExpress 100469 100437 0.02
Gramene 69959 69959 0.01
euHCVdb 47728 47728 0.01
MGI 40387 40202 0.01
FlyBase 34972 34832 0.01
HGNC 29172 29143 <0.01
VectorBase 29057 28725 <0.01
MEROPS 26390 25729 <0.01
TAIR 19447 19396 <0.01
WormPep 19423 19320 <0.01
WormBase 19414 19320 <0.01
ZFIN 16063 16056 <0.01
LinkHub 12019 12019 <0.01
dictyBase 10181 10179 <0.01
CGD 6987 6987 <0.01
RGD 5924 4066 <0.01
PDBsum 5860 3334 <0.01
PDB 5860 3334 <0.01
IntAct 5461 5460 <0.01
LegioList 5399 5369 <0.01
ListiList 4684 4667 <0.01
PseudoCAP 4390 4387 <0.01
PhotoList 3988 3864 <0.01
BuruList 3976 3942 <0.01
AGD 3925 3925 <0.01
REBASE 3685 3660 <0.01
TubercuList 2517 2511 <0.01
DIP 2276 2271 <0.01
PeroxiBase 2082 2076 <0.01
SagaList 1721 1627 <0.01
PhosphoSite 1404 1404 <0.01
Leproma 957 956 <0.01
MypuList 584 580 <0.01
GeneDB_Spombe 515 510 <0.01
ProMEX 483 483 <0.01
World-2DPAGE 418 418 <0.01
SGD 327 327 <0.01
PeptideAtlas 194 194 <0.01
PharmGKB 121 121 <0.01
PHCI-2DPAGE 103 103 <0.01
Reactome 67 62 <0.01
ANU-2DPAGE 59 59 <0.01
SWISS-2DPAGE 29 29 <0.01
REPRODUCTION-2DPAGE 16 16 <0.01
CYGD 16 16 <0.01
PMMA-2DPAGE 3 3 <0.01
Siena-2DPAGE 2 2 <0.01
COMPLUYEAST-2DPAGE 1 1 <0.01
Number of explicitly cross-referenced databases: 102
6. MISCELLANEOUS STATISTICS
Total number of distinct authors cited in UniProtKB/TrEMBL: 266582
Total number of entries encoded on a Mitochondrion: 213912
Total number of entries encoded on a Plasmid: 97022
Total number of entries encoded on a Plastid: 4959
Total number of entries encoded on a Plastid; Apicoplast: 264
Total number of entries encoded on a Plastid; Chloroplast: 74165
Total number of entries encoded on a Plastid; Cyanelle: 7
Total number of entries encoded on a Plastid; Non-photosynthetic plastid: 237
Number of fragments: 1245645
| Submissions and Updates |
|---|
We welcome feedback from our users. We would especially appreciate your notifying us if you find that sequences belonging to your field of expertise are missing from the database. We also would like to be notified about annotations to be updated, if, for example, the function of a protein has been clarified or if new information about post-translational modifications has become available.
Submit new sequence data, updates and corrections at http://www.uniprot.org/support/submissions.shtml
For all queries regarding submissions to UniProtKB and to submit new protein sequence data, please contact:
UniProt Knowledgebase
The EMBL Outstation - The European Bioinformatics Institute
Wellcome Trust Genome Campus
Hinxton
Cambridge CB10 1SD
United Kingdom
Telephone: (+44 1223) 494 462
Telefax: (+44 1223) 494 468
E-mail:
| Download information |
|---|
The latest data of the UniProt Knowledgebase is available in various format (flatfile, XML or FASTA) at http://www.uniprot.org/database/download.shtml. The data is further supplemented by a file containing the sequences of all additional alternative isoforms annotated in UniProtKB/Swiss-Prot. This data set is documented in the file ftp://ftp.uniprot.org/pub/databases/uniprot/current_release/knowledgebase/complete/README.varsplic
For users who wish to download the UniProt Knowledgebase only occasionally, we distribute the latest major release (updated 3 times per year) in flatfile format. Previous UniProtKB/Swiss-Prot and UniProtKB/TrEMBL are archived under ftp://ftp.uniprot.org/pub/databases/uniprot/previous_major_releases. The UniProt Knowledgebase major release is also available on DVD from the EBI.
| Contact |
|---|
| Citation |
|---|
If you want to cite UniProt in a publication, please use the following reference:
The UniProt Consortium
"The Universal Protein Resource (UniProt)"
Nucleic Acids Res. 36:D190-D195(2008) doi:10.1093/nar/gkm895
ExPASy Home page |
Site Map | Search ExPASy | Contact us | Swiss-Prot |
| Hosted by | Mirror sites: | Australia | Brazil | Canada | China | Korea |