Table 2

ORFs in Majority-annotated mixed COGs of stringency 6 that may represent missed genes

ORF COG ida
Organism
Genomic coordinatesb
Annotated gene(s) present in COGc
ORF COG ida
Organism
Genomic coordinatesb
Annotated gene(s) present in COGc


Potential genes missed in current annotations
Potential genes missed in current annotations (continued)


678
Bbur
117772-116825
cdsA
397
Nmen
340008-339358
coaE
314
Bhal
1503738-1503905
rpmG
871
Nmen
554238-552676
mucD/deg
314
Bsub
2477091-2476963
rpmG
723
Nmen
666433-665363
potA/cysA/malK
2346
Bsub
4202360-4202148

119
Nmen
690163-687386
trkH
1717
Cace
243535-242696
alx
1382
Nmen
1056138-1057340
hflX
1908
Cace
1395172-1395522
minE
464
Nmen
1147918-1149261
tilS
2064
Cace
2284461-2283778

464
Nmen
1179954-1181297
tilS
148
Cace
3287735-3286509
tufA
2743
Nmen
1400226-1401977

1840
Cace
3650828-3649308

978
Nmen
1484110-1486353
dnaX
659
Cace
3842459-3840768
plpB
635
Nmen
1527781-1528521

1551
EcoK12
311756-311598
rpmJ
1248
Nmen
1629570-1628017
pepA
148
EcoK12
3469408-3468167
tufA
2793
Nmen
1749455-1752016
gcvP
1551
EcoO157
344941-344783
rpmJ
618
Nmen
2119341-2120882
hrpB
2748
EcoO157
4240898-4240665

618
Nmen
2124720-2128169
hrpB
2531
Hinf
131970-132959
mltA
788
Nmen
2199859-2200686
folD
2319
Hinf
170676-169396
dcuB
2519
Paer
224101-225219
ald
2432
Hinf
235913-238519

1385
Paer
434829-433933

2947
Hinf
370735-372912

38
Paer
4143744-4142569
prfA
1098
Hpyl
315887-316504
dppC
2748
Sent
4247574-4247864

309
Lmon
640139-639558
bioY
192
Tpal
213049-213270
rpmD
2023
Mgen
180733-181020

653
Tpal
624206-625738
ptsP
994
Mmob
102995-102588
nusB
890
Tpal
946250-944889
comM
3131
Mmob
201807-201646
rpmG
946
Tpal
1032059-1031772

3175
Mmob
317659-317411
secG
39
Upar
3002-3886
hemK
3186
Mmob
449811-451241

142
Upar
3861-4427

3000
Mmyc
441031-441783

3131
Upar
725869-726024
rpmG
542
Mmyc
441031-441783

38
VchoI
709524-710558
prfA
199
Mmyc
830915-830742
rpmI
2932
VchoI
1045279-1044317

73
Mmyc
831148-830924
infA
2947
VchoI
1627856-1625871

182
Mmyc
836915-836712
rpsN
1246
VchoI
2869620-2871836
pulA/glgX
3175
Mmyc
973088-973423
secG
2793
VchoII
295059-292882
gcvP
3131
Mmyc
1089962-1090141
rpmG
2621
VchoII
299032-300000
gcvT
314
Mmyc
1089962-1090141
rpmG
2699
VchoII
406033-405167
sbp
1670
Mpen
2755-3009

2573
VchoII
987698-986424
aroF/aroG/aroH
3131
Mpen
1191375-1191163
rpmG
2340
VchoII
1026697-1023563
dhaS/aldA
879
Mpen
1226934-1226722
rpmI




199
Mpen
1317088-1316960
rpmI
Gene annotated in different framed
166
Mpen
1327926-1326898
rplV
1769
Bhal
251734-251429
nrdG
2023
Mpne
207436-207717

3183
Mpul
130854-130480

2090
Nmen
70930-70358

3175
Mpul
412829-413074
secG
148
Nmen
149590-150777
tufA
946
Rpro
433751-433479

2564
Nmen
238562-237666

363
Tpal
262583-262897
rpsT
2572
Nmen
299359-298070
phr





aThe identifiers for COGs are local to this study. They do not correspond to numbers in the NCBI COG database.

bCoordinates in which the first number is greater than the second indicate that the ORF is on the minus strand.

cA named annotated putative ortholog in another organism or paralog within the organism to the ORF listed.

dThese COGs may indicate both that the ORF listed is a missed gene and that the annotated

Powell and Hutchison BMC Bioinformatics 2006 7:31   doi:10.1186/1471-2105-7-31