Table 1

Basic sequence characteristics of the Anopheles gambiae genes isolated during this study and their orthologs from Aedes aegypti and Culex quinquefasciatus.

Species

Length

(nt)

5'UTR (nt)

Coding region

(nt)

3'UTR

(nt)

Protein

length

(aa)

Identity//similarity1

(%)

Gene status2


AgRopn1l

An. gambiae

2391–25063

29–1443

2040

319

679

unannotated

Ae. aegypti

2166

721

54.2/65.3

14993

Cx. quinquefasciatus

2091

696

74.7/83.8

07290

AgDzip1l

An. gambiae

2623–27073

76–1603

2547

48

848

01165

Ae. aegypti

2502

833

46.5/58.1

unannotated

Cx. quinquefasciatus

2328

775

63.5/74.2

11569

Ams

An. gambiae

1356

39

1227

90

408

unannotated

Ae. aegypti

nd4

nd

13.8/30.85

unannotated

Cx. quinquefasciatus

nd

nd

70.8/89.95

unannotated

mts

An. gambiae

1122

108

897

117

295

unannotated

Ae. aegypti

1102

346

25.1/36.1

09638

Cx. quinquefasciatus

912

303

45.8/54.9

00571

AAms

An. gambiae

3724

87

3567

157

1188

unannotated

Ae. aegypti

4707

1568

22.6/38.5

unannotated

Cx. quinquefasciatus

5022

1673

36.1/50.4

unannotated


1 Protein sequence identity/similarity calculated after exclusion of gapped regions. Values for the An. gambiae/Ae. aegypti comparisons are given in the rows corresponding to Aedes aegypti; these values are also representative of the identity/similarity between An. gambiae and Cx. quinquefasciatus. The identity/similarity between Ae. aegypti and Cx. quinquefasciatus proteins is given in the Cx. quinquefasciatus rows.

2 The Ensembl gene ID (abridged by omitting the species codes AGAP0, AAEL0, and CPIJ0 for An. gambiae, Ae. aegypti, and Cx. quinquefasciatus; respectively) is given for genes with complete (bold) or partially (italic) predicted sequence in the recent VectorBase gene builds (An. gambiae: AgamP3.4; Ae. aegypti: AaegL1.1; Cx. quinquefasciatus: CpipJ1.0_5); genes that do not match any gene predictions are marked as unannotated.

3 Several alternative transcription start sites identified.

4 nd – not determined

5 Data based on a portion of a gene sequence; see text for details.

Krzywinska and Krzywinski BMC Genomics 2009 10:300   doi:10.1186/1471-2164-10-300

Open Data