Updated March 10, 2006 by JW.

 

CESA proteins are at the end.

TIGR Rice Community Annotation spreadsheet is at the very end

 

General references: Hazen et al. (2002) Plant Physiol. 128:336 ; Keegstra and Walton (2006) Science, in press.

This web site address: /CSL_updates.htm

 

Please send corrections to walton@msu.edu

 

 

CSLA summary

 

 

Gene

GenBank BAC/WGS

cDNA/EST (incomplete)

introns

protein

Notes

OsCslA1

AP000366

(Xsome 2)   

 

8

521 aa

Richmond also calls this OsCslA1 but several Genbank entries call it OsCslA9 or OsCslA9-like.

OsCslA2

AC021893

(Xsome 10)

D22177

AF435640

8

580

 

OsCslA3

AP003509

AACV01013359

(Xsome 6)

 

8

551

 

OsCslA4

AC073556

AAL84294

(Xsome 3)

 

8

549

 

OsCslA5

AC084766

XM_470723

(Xsome 3)

 

 

10

574

 

OsCslA6

XM_467756*
AP005297* 

(Xsome 2)

AA749881

AF435648

CB645479

CB644141

CB684314

8

574

 

OsCslA7

AP004260

(Xsome 7)

C71923

AF435643

CB672774
CX118263

CB651046

9

585

 

OsCslA9

AP008212
AP004737

(Xsome 6)

CR280231
CX104183
 

8

527

AF432499

OsCslA11

AP004666

AP005757

XM_482559

(Xsome 8)

CF964708 
BX929094
CK061630 

 

8

570

 

The positions of eight introns are conserved in all of them. In addition, OsCslA5 is predicted to have two more introns near its 5’ end, and A7 to have one more intron in its 5’ end. The additional intron in A7 is confirmed by at least EST.

 

OsCslA4 has an atypical intron start of GC instead of GT near the 3’ end.

 

*OsCslA6: The predicted aa sequences of XM_470723 and AP005297 have 3 additional amino acids at first intron junction, which I think is wrong based on alignment with other OsCslA’s. Examination of available ESTs confirms that “TMQ” don’t belong in there.

 

 

>OsCslA1  521 aa

MEVNGGGAAGLPEAWSQVRAPVIVPLLRLAVAVCLTMSVLLFLERMYMAVVISGVKILRRRPDRRYRCDPIPDDDPELGTSAFPVVLIQIPMFNEREVYQLSIGAVCGLSWPSDRLVVQVLDDSTDPVIKEMVRIECERWAHKGVNITYQIRENRKGYKAGALKEGMKHGYVRECEYVAIFDADFQPDPDFLRRTIPFLVHNSDIALVQARWRFVNADECLMTRMQEMSLDYHFTVEQEVSSSVCAFFGFNGTAGVWRVSAVNEAGGWKDRTTVEDMDLAIRASLKGWKFVYLGDVQVKSELPSTFKAFRFQQHRWSCGPANLFRKMLMEIVRNKKVTIWKKIHVIYNFFLIRKIIAHIVTFAFYCLIIPATIFVPEVRIPKWGCVYIPTIITLLNSVGTPRSFHLLFFWILFENVMSLHRTKATLIGLLEAGRANEWVVTEKLGNALKMKSSSKSSAKKSFMRVWDRLNVTELGVAAFLFSCGWYDLAFGKDHFFIYLFFQGAAFFIVGIGYVGTIVPQS

 

>OsCslA2  580 aa [this one was difficult]

MSTNGGAPSQKRSWLPSRPLLTTTTQTYPPPLLPFKKLHAPPTAARRSLPPAASKPMASSSSSSLPAAWAAAVRAWAVAPALRAAVWACLAMSAMLVAEAAWMGLASLAAAAARRLRGYGYRWEPMAAPPDVEAPAPAPAEFPMVLVQIPMYNEKEVYKLSIGAACALTWPPDRIIIQVLDDSTDPFVKELVELECKEWASKKINIKYEVRNNRKGYKAGALRKGMEHTYAQLCDFVAIFDADFEPESDFLLKTMPYLLHNPKIALVQTRWEFVNYNVCLMTRIQKMSLDYHFKVEQESGSFMHAFFGFNGTAGVWRVSAINQSGGWKDRTTVEDMDLAVRASLKGWEFLYVGDIRVKSELPSTFQAYRHQQHRWTCGAANLFRKMAWEIITNKEVSMWKKYHLLYSFFFVRRAIAPILTFLFYCIVIPLSAMVPEVTIPVWGLVYIPTAITIMNAIRNPGSVHLMPFWILFENVMAMHRMRAALSGLLETARANDWVVTEKVGDQVKDELDVPLLEPLKPTECAERIYIPELLLALYLLICASYDFVLGNHKYYIYIYLQAVAFTVMGFGFVGTRTPCS

 

>OsCslA3  551 aa

MAMAGADGPTAGAAAAVRWRGGESLLLLLLRWPSSAELVAAWGAARASAVAPALAAASAACLALSAMLLADAVLMAAACFARRRPDRRYRATPLGAGAGADDDDDDEEAGRVAYPMVLVQIPMYNEREVYKLSIGAACGLSWPSDRLIVQVLDDSTDPTVKGLVELECKSWGNKGKNVKYEVRNTRKGYKAGALKEGLLRDYVQQCNYVAIFDADFQPEPDFLLRTIPYLVRNPQIGLVQAHWEFVNTSECLMTRIQKMTLHYHFKVEQEGGSSTFAFFGFNGTAGVWRISALEEAGGWKDRTTVEDMDLAVRAGLKGWKFVYLADVKVKSELPSNLKTYRHQQHRWTCGAANLFRKVGAEILFTKEVPFWWKFYLLYSFFFVRKVVAHVVPFMLYCVVIPFSVLIPEVTVPVWGVVYVPTTITLLHAIRNTSSIHFIPFWILFENVMSFHRTKAMFIGLLELGGVNEWVVTEKLGNGSNTKPASQILERPPCRFWDRWTMSEILFSIFLFFCATYNLAYGGDYYFVYIYLQAIAFLVVGIGFCGTISSNS

 

>OsCslA4. 549 aa.

MEGQWGRWRLAAAAAASSSGDQIAAAWAVVRARAVAPVLQFAVWACMAMSVMLVLEVAYMSLVSLVAVKLLRRVPERRYKWEPITTGSGGVGGGDGEDEEAATGGREAAAFPMVLVQIPMYNEKEVYKLSIGAACALTWPPDRIIIQVLDDSTDPAIKDLVELECKDWARKEINIKYEIRDNRKGYKAGALKKGMEHIYTQQCDFVAIFDADFQPESDFLLKTIPFLVHNPKIGLVQTRWEFVNYDVCLMTRIQKMSLDYHFKVEQESGSSMHSFFGFNGTAGVWRVSAINEAGGWKDRTTVEDMDLAVRASLKGWQFLYVGDIRVKSELPSTFKAYRHQQHRWTCGAANLFRKMATEIAKNKGVSVWKKLHLLYSFFFVRRVVAPILTFLFYCVVIPLSVMVPEVSIPVWGMVYIPTAITIMNAIRNPGSIHLMPFWILFENVMAMHRMRAALTGLLETMNVNQWVVTEKVGDHVKDKLEVPLLEPLKPTDCVERIYIPELMVAFYLLVCASYDLVLGAKHYYLYIYLQAFAFIALGFGFAGTSTPCS

 

>OsCslA5   574 aa.

MEAGEAAGAVLFLLAAAVSLLAAVSTGALDFTYLVTVVGEGSSTSPGSGGGAWWREAWVGARSRAVAPALQVGVWACMVMSVMLVVEATYNSAVSVAARLVGWRPERWFKWEPLGGGAGAGDEEKGEAAAAAYPMVMVQIPMYNELEVYKLSIGAVCGLKWPKERLIIQVLDDSTDAFIKNLVELECEDWASKGLNIKYATRSGRKGFKAGALKKGMEWDYAKQCEYVAIFDADFQPEPDFLLRTVPFLMHNQNVALVQARWVFVNDRVSLLTRIQKTFLDYHFKAEQEAGSATFAFFSFNGTAGVWRTEAINDAGGWKDRTTVEDMDLAVRATLKGWKFIYLGDLRVKSELPSTYKAYCRQQFRWSCGGANLFRKMIWDVLVAKKVSSLKKIYILYSFFLVRRVVAPAVAFILYNVIIPVSVMIPELFLPIWGVAYIPTALLIVTAIRNPENLHTVPLWILFESVMSMHRLRAAVAGLLQLQEFNQWIVTKKVGNNAFDENNETPLLQKSRKRLINRVNLPEIGLSVFLIFCASYNLVFHGKNSFYINLYLQGLAFFLLGLNCVGTLPDHCCF

 

>OsCslA6   574 aa.

MQGSSTSILHFVPSDPTSTSVLDFLSPTPRGTSPVHDRRLHAGDLALRAGGDRLLVADTVAAVVESLVQAWRQVRMELLVPLLRGAVVACMVMSVIVLAEKVFLGVVSAVVKLLRRRPARLYRCDPVVVEDDDEAGRASFPMVLVQIPMYNEKEVYQLSIGAACRLTWPADRLIVQVLDDSTDAIVKELVRKECERWGKKGINVKYETRKDRAGYKAGNLREGMRRGYVQGCEFVAMLDADFQPPPDFLLKTVPFLVHNPRLALVQTRWEFVNANDCLLTRMQEMSMDYHFKVEQEAGSSLCNFFGYNGTAGVWRRQVIDESGGWEDRTTAEDMDLALRAGLLGWEFVYVGSIKVKSELPSTLKAYRSQQHRWSCGPALLFKKMFWEILAAKKVSFWKKLYMTYDFFIARRIISTFFTFFFFSVLLPMKVFFPEVQIPLWELILIPTAIILLHSVGTPRSIHLIILWFLFENVMALHRLKATLIGFFEAGRANEWIVTQKLGNIQKLKSIVRVTKNCRFKDRFHCLELFIGGFLLTSACYDYLYRDDIFYIFLLSQSIIYFAIGFEFMGVSVSS

 

>OsCslA7   585

MVEAGEIGGAAVFALAAAAALSAASSLGAVDFRRPLAAVGGGGAFEWDGVVPWLIGVLGGGDEAAAGGVSVGVAAWYEVWVRVRGGVIAPTLQVAVWVCMVMSVMLVVEATFNSAVSLGVKAIGWRPEWRFKWEPLAGADEEKGRGEYPMVMVQIPMYNELEVYKLSIGAACELKWPKDKLIVQVLDDSTDPFIKNLVELECESWASKGVNIKYVTRSSRKGFKAGALKKGMECDYTKQCEYIAIFDADFQPEPNFLLRTVPFLMHNPNVALVQARWAFVNDTTSLLTRVQKMFFDYHFKVEQEAGSATFAFFSFNGTAGVWRTTAINEAGGWKDRTTVEDMDLAVRASLNGWKFIYVGDIRVKSELPSTYGAYCRQQFRWACGGANLFRKIAMDVLVAKDISLLKKFYMLYSFFLVRRVVAPMVACVLYNIIVPLSVMIPELFIPIWGVAYIPMALLIITTIRNPRNLHIMPFWILFESVMTVLRMRAALTGLMELSGFNKWTVTKKIGSSVEDTQVPLLPKTRKRLRDRINLPEIGFSVFLIFCASYNLIFHGKTSYYFNLYLQGLAFLLLGFNFTGNFACCQ

 

>OsCslA9   527

MAAAGAVLPEQIAAMWEQVKAPVVVPLLRLSVAACLAMSVMLFVEKVYMSVVLVGVHLFGRRPDRRYRCDPIVAAGADNDDPELADANAAFPMVLIQIPMYNEREVYKLSIGAACGLSWPSDRVIVQVLDDSTDPVIKEMVQVECKRWESKGVRIKYEIRDNRVGYKAGALREGMKHGYVRDCDYVAIFDADFQPDPDFLARTIPFLVHNPDIALVQARWKFVNANECLMTRMQEMSLDYHFKVEQEVGSSTHAFFGFNGTAGVWRISAMNEAGGWKDRTTVEDMDLAVRAGLKGWKFVYLGDLMVKSELPSTFKAFRYQQHRWSCGPANLFRKMLVEIATNKKVTLWKKIYVIYNFFLVRKIIGHIVTFVFYCLVVPATVLIPEVEIPRWGYVYLPSIVTILNSIGTPRSLHLLIFWVLFENVMSLHRTKATLIGLLETGRVNEWVVTEKLGDALKLKLPGKAFRRPRMRIGDRVNALELGFSAYLSFCGCYDIAYGKGYYSLFLFLQSITFFIIGVGYVGTIVPH

 
>CslA11. BAC AP004666, 8 introns. 570 aa.

MSSSGGGGVAEEVARLWGELPVRVVWAAVAAQWAAAAAAARAAVVVPPVRALVAVSLAMTVMILAEKLFVAAVCLAVRAFRLRPDRRYKWLPIGAAAAAASSEDDEESGLVAAAAAFPMVLVQIPMFNEREVYKLSIGAACSLDWPSDRVVIQVLDDSTDLVVKDLVEKECQKWQGKGVNIKYEVRGNRKGYKAGALKEGLKHDYVKECEYIAMFDADFQPESDFLLRTVPFLVHNSEIALVQTRWKFVNANECLLTRFQEMSLDYHFKYEQEAGSSVYSFFGFNGTAGVWRIAAIDDAGGWKDRTTVEDMDLAVRATLQGWKFVYVGDVKVKSELPSTFKAYRFQQHRWSCGPANLFKKMMVEILENKKVSFWNKIHLWYDFFFVGKIAAHTVTFIYYCFVIPVSVWLPEIEIPLWGVVYVPTVITLCKAVGTPSSFHLVILWVLFENVMSLHRIKAAVTGILEAGRVNEWVVTEKLGDANKTKPDTNGSDAVKVIDVELTTPLIPKLKKRRTRFWDKYHYSEIFVGICIILSGFYDVLYAKKGYYIFLFIQGLAFLIVGFDYIGVCPP

 
 
 
 

OsCslC summary

 

gene

GenBank

ESTcDNA

introns

protein

Notes

OsCslC1

AP003377

AP008207

(Xsome 1)

                

 

4

690

BK000086

OsCslC2

AP005886
AP005568

(Xsome 9)

AI978402

AF435650
(1.8 kb)

4

698

 

OsCslC3

AP004013

(Xsome 8)

 

4

745

 

OsCslC4

AC122144

AC108884

 pseudogene

 

xxxxx

 

OsCslC5

AC034258

 pseudogene

 

xxxxx

 

OsCslC6

AC098694

AC078891

(Xsome 10)

 

pseudogene

 

xxxxx

 

OsCslC7

AC108873

AP008211

(Xsome 5)

C74862

AF435642

(1.1)

AAL38527
XM_475689

 

4

688

 

OsCslC8

AC083751

(Xsome 2)

 pseudogene

 

xxxxx

 

OsCslC9

AC133450

(Xsome 3)

  
  
  
  
AAAA02011089 (aligns perfectly
 with 8085)
DP000009 – no extra Ala
AP008209 
no extra Ala
 

 

AU068180
CK051466

AF435641
(2.0)(lacks the Ala)

CK008085 (has Ala) 
CK056494 (lacks Ala)
AK121805
(has the Ala) 
AF435652S1
 (lacks the Ala)
 
 

 

4

595 or 596

 

AAT85054 (has AAA); AF435641 (has AA)

 

Our genome sequence
 has one less codon
(an Ala near the N
 terminus) than some 
entries in GenBank: 
MAPWSGLWGGKLAAGESP vs.
 MAPWSGLWGGKLAAAGESP 
There are ESTs for 
both forms. The extra 
Ala is due to 
insertion of 
nucleotides CCG. Based on the fact that all genomic sequences lack the extra Ala, and that 2 other CslC’s (1  and 10) have only two 
Ala in this location 
(both have VAAG vs. 
LAAG/LAAAG for C9), I 
think that 595 is the 
correct length, not 
596. Or else it’s a 
natural variant. 

 

OsCsl10

AP005309

AP008213

(Xsome 7)

 

4

686

yes

 

Intron borders are conserved.

 

>OsCslC1
MARWWGGEGRGGSGTPVVVKMESPEWAISEVEAGAAAPGSPAAGGKAGRGKNARQITWVLLLKAHRAAGKLTGAASAALSVAAAARRRVAAGRTDSDDAAAAPPGESPALRARFHGFLRAFLLLSVLLLAVDVAAHAQGWHAVVPDLLAVEGLFAAAYASWLRVRLEYLAPGLQFLANACVVLFLIQSADRLILCLGCLWIKLKGIKPVPKASGGGGGGKGSDDVEAGADEFPMVLVQIPMCNEKEVYQQSIGAVCNLDWPRSNFLVQVLDDSDDAATSALIKEEVEKWQREGVRILYRHRVIRDGYKAGNLKSAMNCSYVKDYEFVVIFDADFQPQADFLKRTVPHFKGNEDVGLVQARWSFVNKDENLLTRLQNINLCFHFEVEQQVNGVFLNFFGFNGTAGVWRIKALEDSGGWMERTTVEDMDIAVRAHLKGWKFLYINDVECQCELPESYEAYRKQQHRWHSGPMQLFRLCFVDIIKSKIGVWKKFNLIFLFFLLRKLILPFYSFTLFCIILPMTMFVPEAELPAWVVCYIPATMSLLNILPAPKSFPFIVPYLLFENTMSVTKFNAMISGLFQLGSAYEWVVTKKSGRSSEGDLVSLVEKQPKQQRVGSAPNLDSLAKESHPKKDSKKKKHNRIYQKELALSFLLLTAAARSLLSVQGIHFYFLLFQGVSFLVVGLDLIGEQVE

 

>OsCslC2
MAPPGVGVGVAYLWGKGRGGRKGTPVVVTMESPNYSVVEVDGPDAEAELRTAAVAMDKGGGRGRSRSRTARQLTWVLLLRARRAAGRLASFAAAAARRFRRSPADAADELGRGRGRLMYGFIRGFLALSLLALAVELAAYWNGWRLRRPELHVPEAVEIEGWAHSAYISWMSFRADYIRRPIEFLSKACILLFVIQSMDRLVLCLGCFWIKLRKIKPRIEGDPFREGSGYQHPMVLVQIPMCNEKEVYEQSISAACQLDWPREKFLIQVLDDSSDESIQLLIKAEVSKWSHQGVNIVYRHRVLRTGYKAGNLKSAMSCDYVKDYEFVAIFDADFQPTPDFLKKTIPHFEGNPELGLVQARWSFVNKDENLLTRLQNINLCFHFEVEQQVNGVFLNFFGFNGTAGVWRIQALEESGGWLERTTVEDMDIAVRAHLNGWKFIFLNDVKVLCELPESYEAYRKQQHRWHSGPMHLFWLCLPDILTAKISSWKKANLILLFFLLRKLILPFYSFTLFCVILPLTMFVPEAELPVWVICYVPVCMSFLNILPSPRSFPFIVPYLLFENTMSVTKFNAMVSGLFKLGSSYEWIVTKKSGRSSESDLSTAVERDTKDLTLPRLQKQISESELIDLKMQKERQEKAPLGAKKANKIYKKELALSLLLLTAATRSLLSAQGIHFYFLLFQGVSFLFVGLDLIGEQID

 

>OsCslC3
MAPPPNTYSESWWGGKEERGTPVVVKMDNPYSLVEIDGPGMAAPSEKARGKNAKQLTWVLLLRAHRAVGCVAWLAAGFWAVLGAVNRRVRRSRDADAEPDAEASGRGRAMLRFLRGFLLLSLAMLAFETVAHLKGWHFPRSAAGLPEKYLRRLPEHLQHLPEHLRRHLPEHLRMPEKEEIEGWLHRAYVAWLAFRIDYIAWAIQKLSGFCIALFMVQSVDRLVLCLGCFWIKLRGIKPVADTSISNDDIEATAGDGGGYFPMVLIQMPMCNEKEVYETSISHVCQIDWPRERMLVQVLDDSDDETCQMLIKAEVTKWSQRGVNIIYRHRLNRTGYKAGNLKSAMSCDYVRDYEFVAIFDADFQPNPDFLKLTVPHFKGNPELGLVQARWSFVNKDENLLTRLQNINLCFHFEVEQQVNGVYLSFFGFNGTAGVWRIKALEDSGGWMERTTVEDMDIAVRAHLNGWKFIFLNDVKVLCELPESYQAYRKQQHRWHSGPMQLFRLCLPAVFKSKISTWKKANLVMLFFLLRKLILPFYSFTLFCVILPLTMFVPEAELPIWVICYVPVIMSVLNILPAPKSFPFVIPYLLFENTMSVTKFNAMVSGLFQLGSSYEWVVTKKAGRTSSESDILALAEAADADARPPPAKLHRGVSEGGLKEWAKLHKEQEDATAAAAAAAAPGTPVKKSKAAKAPNRIFKKELALAFLLLTAATRSLLSAQGLHFYFLLFQGVTFLAVGLDLIGEQvs

 

>OsCslC7
MAPSWWGMDGRKRHRPLTGGEAVDKKESATGVPTRWWCSREVGTGFAAGDGKTGPAKRRQIKWVLMLKAHRAAGRLTGAASAALAVASAARRRVASGRTDADAAPGESTALRARSYGCIRVSLVLSLLLLAVEVAAYLQGWHLEEVASLLAVDGLFAASYAGWMRLRLDYLAPPLQFLTNACVALFMVQSIDRLVLCLGCFWIRFKGIKPVPQAAAAGKPDVEAGAGDYPMVLVQMPMCNEREVYQQSIGAVCNLDWPKSNFLVQVLDDSDDATTSALIKEEVEKWQREGVRIIYRHRVIRDGYKAGNLKSAMNCSYVKDYEFVVIFDADFQPQADFLKRTVPHFKGKDDVGLVQARWSFVNKDENLLTRLQNVNLCFHFEVEQQVNGAFLNFFGFNGTAGVWRIKALEDSGGWMERTTVEDMDIAVRAHLKGWKFVFLNDVECQCELPESYEAYRKQQHRWHSGPMQLFRLCFVDIIKSKIGFWKKFNLIFLFFLLRKLILPFYSFTLFCVILPMTMFVPEAELPAWVVCYIPATMSILNILPAPKSFPFIVPYLLFENTMSVTKFNAMISGLFQLGSAYEWVVTKKSGRSSEGDLVGLVEKHSKQQRVGSAPNLDALTKEESNPKKDSKKKKHNRIYRKELALSFLLLTAAARSLLSAQGIHFYFLLFQGVSFLVVGLDLIGEQVE

 

>OsCslC9. 596 aa or 595 aa.
MAPWSGLWGGKLAAAGESPVLRSRFYAFIRAFVVLSVLLLIVELGAYINGWDDLAASALALPVIGVESLYASWLRFRATYVAPFIQFLTDACVVLFLIQSADRLIQCLGCFYIHLKRIKPNPKSPALPDAEDPDAAYYPMVLVQIPMCNEKEVYQQSIAAVCNLDWPRSNFLVQVLDDSDDPTTQTLIREEVLKWQQNGARIVYRHRVLRDGYKAGNLKSAMSCSYVKDYEFVAIFDADFQPNPDFLKRTVPHFKDNDELGLVQARWSFVNKDENLLTRLQNINLCFHFEVEQQVNGIFLNFFGFNGTAGVWRIKALDDSGGWMERTTVEDMDIAVRAHLRGWKFIFLNDVECQCELPESYEAYRKQQHRWHSGPMQLFRLCLPDIIKCKIVFWKKANLIFLFFLLRKLILPFYSFTLFCIILPMTMFVPEAELPDWVVCYIPALMSLLNILPSPKSFPFIIPYLLFENTMSVTKFNAMISGLFQLGNAYEWVVTKKSGRSSEGDLISLAPKELKHQKTESAPNLDAIAKEQSAPRKDVKKKLNRIYKKELALSLLLLTAAARSLLSKQGIHFYFLLFQGISFLLVGLDLIGEQIE

 

source of the variation between 595 and 596 aa forms of CslC9: 
...GGGCGGGAAGTTGGCCG---CCGGCGAGAGCCCCGTGCTCCGCTCCCGCTTCTACGCGT TCATCAGGGCATTCGTCGCCCT...

 

>OsCslC10. BAC AP005309, chromosome 7.
MAPWSGFWAASRPALAAAAAGGTPVVVKMDNPNWSISEIDADGGEFLAGGRRRGRGKNAKQITWVLLLKAHRAAGCLAWLASAAVALGAAARRRVAAGRTDDADAETPAPRSRLYAFIRASLLLSVFLLAVELAAHANGRGRVLAASVDSFHSSWVRFRAAYVAPPLQLLADACVVLFLVQSADRLVQCLGCLYIHLNRIKPKPISSPAAAAAALPDLEDPDAGDYYPMVLVQIPMCNEKEVYQQSIAAVCNLDWPRSNILVQVLDDSDDPITQSLIKEEVEKWRQNGARIVYRHRVLREGYKAGNLKSAMSCSYVKDYEYVAIFDADFQPYPDFLKRTVPHFKDNEELGLVQARWSFVNKDENLLTRLQNINLCFHFEVEQQVNGIFINFFGFNGTAGVWRIKALEDSGGWMERTTVEDMDIAVRAHLNGWKFVFLNDVECQCELPESYEAYRKQQHRWHSGPMQLFRLCLPDIIRCKIAFWKKANLIFLFFLLRKLILPFYSFTLFCIILPMTMFIPEAELPDWVVCYIPALMSFLNILPAPKSFPFIIPYLLFENTMSVTKFNAMISGLFQLGSAYEWVVTKKSGRSSEGDLIALAPKELKQQKILDLTAIKEQSMLKQSSPRNEAKKKYNRIYKKELALSLLLLTAAARSLLSKQGIHFYFLMFQGLSFLLVGLDLIGEDVK

 

 

 

OsCSLD summary

 

 gene

GenBank #

EST #

cDNA

introns

protein

Notes

OsCslD1

AC027037

(Xsome 10)

CA754914

 

1

1127 aa

OSJNBa0035H01
AAL58185

 

OsCslD2

AP001552

(Xsome 6)

JSC EST14

AA753598

(about 18 at TIGR)

AK105393 (4089 bp) AK102134 (3968 bp) AK102695 (2757 bp)

2

1170

Called D4 at TIGR. Os06g02180 

OsCslD3

AP004459

AP008214
(Xsome 8)

AA735599

 

1

1115

BK000093

 

OsCslD4

AL845342
(Xsome 12)

JSC EST9

AU078363

=AU082165

AU082190

=AU082189

AF435644

(1196 bp)

1

1215 

 

OsCslD5

AP005449

AP008212
(Xsome 6)

none?

none?

0

1012

Called D4 at TIGR. Os06g22980980 Os06g22980

 

 

>OsCslD1. 1 intron. 1127 aa
MASKGILKNGGKPPTAPSSAAPTVVFGRRTDSGRFISYSRDDLDSEISSVDFQDYHVHIPMTPDNQPMDPAAGDEQQYVSSSLFTGGFNSVTRAHVMEKQASSARATVSACMVQGCGSKIMRNGRGADILPCECDFKICVDCFTDAVKGGGGVCPGCKEPYKHAEWEEVVSASNHDAINRALSLPHGHGHGPKMERRLSLVKQNGGAPGEFDHNRWLFETKGTYGYGNAIWPEDDGVAGHPKELMSKPWRPLTRKLRIQAAVISPYRLLVLIRLVALGLFLMWRIKHQNEDAIWLWGMSIVCELWFALSWVLDQLPKLCPINRATDLSVLKDKFETPTPSNPTGKSDLPGIDIFVSTADPEKEPVLVTANTILSILAADYPVDKLACYVSDDGGALLTFEAMAEAASFANLWVPFCRKHEIEPRNPDSYFNLKRDPFKNKVKGDFVKDRRRVKREYDEFKVRVNGLPDAIRRRSDAYHAREEIQAMNLQREKMKAGGDEQQLEPIKIPKATWMADGTHWPGTWLQASPEHARGDHAGIIQVMLKPPSPSPSSSGGDMEKRVDLSGVDTRLPMLVYVSREKRPGYDHNKKAGAMNALVRASAIMSNGPFILNLDCDHYVYNSKAFREGMCFMMDRGGDRLCYVQFPQRFEGIDPSDRYANHNTVFFDVNMRALDGLQGPVYVGTGCLFRRIALYGFDPPRSKDHTTPWSCCLPRRRRTRSQPQPQEEEEETMALRMDMDGAMNMASFPKKFGNSSFLIDSIPVAEFQGRPLADHPSVKNGRPPGALTIPRETLDASIVAEAISVVSCWYEEKTEWGTRVGWIYGSVTEDVVTGYRMHNRGWKSVYCVTHRDAFRGTAPINLTDRLHQVLRWATGSVEIFFSRNNALFASSKMKVLQRIAYLNVGIYPFTSVFLIVYCFLPALSLFSGQFIVQTLNVTFLTYLLIITITLCLLAMLEIKWSGIALEEWWRNEQFWLIGGTSAHLAAVLQGLLKVIAGIEISFTLTSKQLGDDVDDEFAELYAVKWTSLMIPPLTIIMINLVAIAVGFSRTIYSTIPQWSKLLGGVFFSFWVLAHLYPFAKGLMGRRGRTPTIVYVWSGLVAITISLLWIAIKPPSAQANSQLGGSFSFP


>OsCslD2. 2 introns. 1170 aa.
MASSGGGGLRHSNSSRLSRMSYSGEDGRAQAPGGGGDRPMVTFARRTHSGRYVSYSRDDLDSELGNSGDMSPESGQEFLNYHVTIPATPDNQPMDPAISARVEEQYVSNSLFTGGFNSVTRAHLMDKVIESEASHPQMAGAKGSSCAINGCDAKVMSDERGDDILPCECDFKICADCFADAVKNGGACPGCKDPYKATELDDVVGARPTLSLPPPPGGLPASRMERRLSIMRSQKAMTRSQTGDWDHNRWLFETKGTYGYGNAIWPKENEVDNGGGGGGGGGLGGGDGQPAEFTSKPWRPLTRKLKIPAGVLSPYRLLILIRMAVLGLFLAWRIKHKNEDAMWLWGMSVVCELWFGLSWLLDQLPKLCPVNRATDLAVLKDKFETPTPSNPNGRSDLPGLDIFVSTADPEKEPPLVTANTILSILAADYPVEKLSCYVSDDGGALLTFEAMAEAASFANMWVPFCRKHDIEPRNPESYFNLKRDPYKNKVRSDFVKDRRRVKREYDEFKVRINSLPDSIRRRSDAYHAREEIKAMKRQREAALDDVVEAVKIPKATWMADGTHWPGTWIQPSAEHARGDHAGIIQVMLKPPSDDPLYGTSGEEGRPLDFTEVDIRLPMLVYVSREKRPGYDHNKKAGAMNALVRSSAVMSNGPFILNLDCDHYVYNSQAFREGMCFMMDRGGDRIGYVQFPQRFEGIDPSDRYANHNTVFFDVNMRALDGIMGPVYVGTGCLFRRIALYGFDPPRSKEHSGCCSCCFPQRRKVKTSTVASEERQALRMADFDDEEMNMSQFPKKFGNSNFLINSIPIAEFQGRPLADHPGVKNGRPPGALTVPRDLLDASTVAEAISVISCWYEDKTEWGQRVGWIYGSVTEDVVTGYRMHNRGWKSVYCVTKRDAFRGTAPINLTDRLHQVLRWATGSVEIFFSRNNALLASRKMKFLQRIAYLNVGIYPFTSIFLIVYCFLPALSLFSGQFIVRTLNVTFLTYLLVITLTMCMLAVLEIKWSGISLEEWWRNEQFWLIGGTSAHLAAVLQGLLKVIAGIEISFTLTSKSGGDEADDEFADLYIVKWTSLMIPPIVIMMVNLIAIAVGFSRTIYSEIPQWSKLLGGVFFSFWVLAHLYPFAKGLMGRRGRTPTIVFVWSGLLAITISLLWVAINPPSQNSQIGGSFTFP


>OsCslD3. 1 intron. 1115 aa. MSTGPGKKAIRNAGGVGGGAGPSAGGPRGPAGQAVKFARRTSSGRYVSLSREDIDMEGELAADYTNYTVQIPPTPDNQPMLNGAEPASVAMKAEEQYVSNSLFTGGFNSATRAHLMDKVIESSVSHPQMAGAKGSRCAMPACDGSAMRNERGEDVDPCECHFKICRDCYLDAQKDGCICPGCKEHYKIGEYADDDPHDGKLHLPGPGGGGNKSLLARNQNGEFDHNRWLFESSGTYGYGNAFWPKGGMYDDDLDDDVDKLGGDGGGGGGGGPLPEQKPFKPLTRKIPMPTSVISPYRIFIVIRMFVLLFYLTWRIRNPNMEALWLWGMSIVCELWFAFSWLLDMLPKVNPVNRSTDLAVLKEKFETPSPSNPHGRSDLPGLDVFVSTADPEKEPVLTTATTILSILAVDYPVEKLACYVSDDGGALLTFEAMAEAASFANVWVPFCKKHDIEPRNPDSYFSVKGDPTKGKRRNDFVKDRRRVKREFDEFKVRINGLPDSIRRRSDAFNAREDMKMLKHLRETGADPSEQPKVKKATWMADGSHWPGTWAASAPDHAKGNHAGILQVMLKPPSPDPLYGMHDDDQMIDFSDVDIRLPMLVYMSREKRPGYDHNKKAGAMNALVRCSAVMSNGPFMLNFDCDHYINNAQAVREAMCFFMDRGGERIAYIQFPQRFEGIDPSDRYANNNTVFFDGNMRALDGLQGPMYVGTGCMFRRFAVYGFDPPRTAEYTGWLFTKKKVTTFKDPESDTQTLKAEDFDAELTSHLVPRRFGNSSPFMASIPVAEFQARPLADHPAVLHGRPSGALTVPRPPLDPPTVAEAVSVISCWYEDKTEWGDRVGWIYGSVTEDVVTGYRMHNRGWRSVYCITKRDAFLGTAPINLTDRLHQVLRWATGSVEIFFSRNNAFLASRKLMLLQRISYLNVGIYPFTSIFLLVYCFIPALSLFSGFFIVQKLDIAFLCYLLTMTITLVALGILEGLLKVMAGIEISFTLTAKAAADDNEDIYADLYIVKWSSLLIPPITIGMVNIIAIAFAFARTIYSDNPRWGKFIGGGFFSFWVLAHLNPFAKGLMGRRGKTPTIVFVWSGLLSITVSLLWVAISPPEANSNGGARGGGFQFP


>OsCslD4. 1 intron. 1215 aa. MSRRLSLPAGAPVTVAVSPVRSPGGDAVVRRGSGLTSPVPRHSLGSSTATLQVSPVRRSGGSRYLGASRDGGADESAEFVHYTVHIPPTPDRATASVASEAEAAAEAEEVHRPQRSYISGTIFTGGLNCATRGHVLNFSGEGGATAASRAAASGNMSCKMRGCDMPAFLNGGRPPCDCGFMICKECYAECAAGNCPGCKEAFSAGSDTDESDSVTDDDDDEAVSSSEERDQLPLTSMARKFSVVHSMKVPGAAANGNGKPAEFDHARWLFETKGTYGYGNALWPKDGHAHSGAGFVAADEPPNFGARCRRPLTRKTSVSQAILSPYRLLIAIRLVALGFFLAWRIRHPNPEAVWLWAMSVACEVWFAFSWLLDSLPKLCPVHRAADLAVLAERFESPTARNPKGRSDLPGIDVFVTSADPEKEPPLVTANTILSILAADYPVEKLACYLSDDGGALLSFEALAETASFARTWVPFCRKHGVEPRCPEAYFGQKRDFLKNKVRVDFVRERRKVKREYDEFKVRVNSLPEAIRRRSDAYNAGEELRARRRQQEEAAAAAAAGNGELGAAAVETAAVKATWMSDGSHWPGTWTCPAADHARGDHAGIIQAMLAPPTSEPVMGGEAAECGGLIDTTGVDVRLPMLVYVSREKRPGYDHNKKAGAMNALVRTSAIMSNGPFILNLDCDHYVHNSSALREGMCFMLDRGGDRVCFVQFPQRFEGVDPSDRYANHNLVFFDVSMRAMDGLQGPMYVGTGCVFRRTALYGFSPPRATEHHGWLGRRKIKLFLTKKKSMGKKTDRAEDDTEMMLPPIEDDDGGADIEASAMLPKRFGGSATFVASIPVAEYQGRLLQDTPGCHHGRPAGALAVPREPLDAATVAEAIGVISCFYEEKTEWGRRIGWIYGSVTEDVVTGYRMHNRGWRSVYCVTPRRDAFRGTAPINLTDRLHQVLRWATGSVEIFFSRNNALFASPRMKLLQRVAYFNAGMYPFTSVFLLAYCLLPAVSLFSGKFIVQRLSATFLAFLLVITLTLCLLALLEIKWSGITLHEWWRNEQFWVIGGTSAHPAAVLQGLLKVIAGVDISFTLTSKPGNGGGDGGVGGEGNDDEAFAELYEVRWSYLMVPPVTIMMVNAVAIAVAAARTLYSEFPQWSKLLGGAFFSFWVLCHLYPFAKGLLGRRGRVPTIVFVWSGLISMIISLLWVYINPPAGARERIGGGGFSFP


>OsCslD5. 0 intron. 1012 aa
MSVDYANYTVLMPPTPDNQPSGGAPPAAPSAGGARPGDLPLPPYGSSSSSRLVNRRGGGDDGAKMDRRLSTARVPAPSSNKSLLVRSQTGDFDHNRWLFETKGTYGIGNAYWPQDNVYGDDGGGGAVKMEDLVEKPWKPLSRKVPIPPGILSPYRLLVLVRFVALFLFLVWRVTNPNMDALWLWGISIVCEFWFAFSWLLDQMPKLNPINRAADLAALKEKFESPSPTNPTGRSDLPGLDVFISTADPYKEPTLVTANTLLSILATEYPVEKLFVYISDDGGALLTFESMAEACAFAKVWVPFCRKHSIEPRNPDSYFTQKGDPTKGKKRPDFVKDRRWIKREYDEFKIRVNSLPDLIRRRANALNARERKLARDKQAAGDADALASVKAATWMADGTHWPGTWLDPSPDHAKGDHASIVQVMIKNPHHDVVYGEAGDHPYLDMTDVDMRIPMFAYLSREKRAGYDHNKKAGAMNAMVRASAILSNGPFMLNFDCDHYIYNCQAIREAMCYMLDRGGDRICYIQFPQRFEGIDPSDRYANHNTVFFDGNMRALDGLQGPMYVGTGCLFRRYAIYGFNPPRAIEYRGTYGQTKVPIDPRQGSEAMPGAGGGRSGGGSVGGDHELQALSTAHPDHEAPQKFGKSKMFIESIAVAEYQGRPLQDHPSVLNGRPPGALLMPRPPLDAATVAESVSVISCWYEDNTEWGQRLGWIYGSVTEDVVTGYRMHNRGWRSVYCITRRDAFRGTAPINLTDRLHQVLRWATGSVEIFFSKNNAVLASRRLKFLQRMAYLNVGIYPFTSLFLIMYCLLPALSLFSGQFIVATLDPTFLSYLLLITITLMLLCLLEVKWSGIGLEEWWRNEQFWVIGGTSAHLAAVLQGLLKVVAGIEISFTLTAKAAAEDDDDPFAELYLIKWTSLFIPPLAVIGINIIALVVGVSRTVYAEIPQYSKLLGGGFFSFWVLAHYYPFAKGLMGRRGRTPTIVYVWAGLISITVSLLWITISPPDDSVAQGGIDV  

 

 

OsCslE summary

 

 

Of the original “E” genes (Hazen et al.), only OsCslE1, E2, and E6 appear at this point to be valid genes. The original E3, E4, or E5 are not valid.

 

Intron borders are conserved. OsCslE1 and E6 each have six introns whose boundaries are the same with each other and with six of the seven introns in E2. (E2 has one extra intron.)

 

>OsCslE1. 737 aa.   cDNA AK102766
METTAAATAAERRRPLFTTEELGGRAVYRVQAATVAAGILLVLYYRATRVPAAGEGRAAWLGMAAAELWFAVYWVIAQSVRWRPFRRRTFRDRLAERYEQNLPGVDIFVCTADPQSEPPSLVISTILSVMAYNYPSEKISVYLSDDGGSILTFYALWEASIFAKKWLPFCKRYNIEPRSPAAYFSESKVHHNLCIPKEWALIKNLYEEMRERIDTATMSGKIPEEMKLKHKGFDEWNSDFTLKNHQPIVQILIDGKNRNAIDDDRNVLPTMVYVAREKRPQYHHNFKAGALNALIRVSSVISDSPVILNVDCDMYSNNSDSIRDALCFFLDEEMGQKIGFVQYPQIFNNMTQNDIYGNSFNVSYHVEMCGLDSVGGCLYIGTGCFHRREILCGRIFSKDYKENWNRGIKERGKENINEIEEKATSLVTCTYEHRTQWGNDIGVKYGFPAEDIITGLAIHCRGWESAFINPKRAAFLGLAPSTLAQNILQHKRWSEGNLTIFLSKYCSFLFGHGKIKLQLQMGYCICGLWAANSLPTLYYVVIPSLGLVKGTPLFPQIMSPWATPFIYVFCVKTLYGLYEALLSGDTLKGWWNGQRMWMVKSITSYLYGFIDTIRKCVGMSKMSFEVTAKVSGHDEAKRYEQEILEFGSSSPEYVIIATVALLNFVCLVGGLSQIMAGVWNMPWNVFLPQAILCGMIVIINMPIYEAMFLRKDNGRIPTAVTLASIGFVMLAFLVPIV

>OsCslE2. AAL25130. AK101487
MAGSGGGVVSGGRQRGPPLFATEKPGRMAMAAYRVSAATVFAGVLLIWLYRATHLPPGGGDGVRRWAWLGMLAAELWFGFYWVLTLSVRWCPVYRRTFKDRLAQSYSEDELPSVDIFVCTADPTAEPPMLVISTVLSVMAYDYLPEKLNIYLSDDAGSVLTFYVLCEASEFAKHWIPFCKKYKVEPRSPAAYFAKVASPPDGCGPKEWFTMKELYKDMTDRVNSVVNSGRIPEVPRCHSRGFSQWNENFTSSDHPSIVQilidsnkqkavdidgnalptlvymarekkpqkqhhfkagslnalIRVSSVISNSPIIMNVDCDMYSNNSESIRDALCFFLDEEQGQDIGFVQYPQNFENVVHNDIYGHPINVVNELDHPCLDGWGGMCYYGTGCFHRREALCGRIYSQEYKEDWTRVAGRTEDANELEEMGRSLVTCTYEHNTIWGIEKGVRYGCPLEDVTTGLQIQCRGWRSVYYNPKRKGFLGMTPTSLGQILVLYKRWTEGFLQISLSRYSPFLLGHGKIKLGLQMGYSVCGFWAVNSFPTLYYVTIPSLCFLNGISLFPEKTSPWFIPFAYVMVAAYSCSLAESLQCGDSAVEWWNAQRMWLIRRITSYLLATIDTFRRILGISESGFNLTVKVTDLQALERYKKGMMEFGSFSAMFVILTTVALLNLACMVLGISRVLLQESPGGLETLFLQAVLCVLIVAINSPVYEALFLRRDKGSLPASVARVSICFVLPLCILSICK

 

>OsCslE6. 718 aa
METTTTERRRLFATEKVGGRAVYRLQAATVAAGILLVLYYRATRVPAAGEGRAAWLGMAAAELWFAVYWVITQSVRWCPVRRRTFKNRLAERYKENLPGVDVFVCTADPHAEPPSLVISTILSVMAYNYPSEKISVYLSDDGGSILTFYALWEASMFAKKWLPFCRRYNIEPRSPAAYFSESEGHHNLCSPKEWSFIKRIDSAVMSGKIPEEFKLKMKGFDEWNSEMTSKNHQPIVLIDGKSQNAVDDDGNVLPTLVYMAREKSPQYHHNFKAGALNALIRVSALISDSPVILNVDCDMYSNNSDSIRDALCFFLDEEMSHKIGFVQYPQNYNNMTKNNIYGNSLNVINHVEMRGLDSAGGCLYIGTGCFHRREILCGKKFSKDYKEDWGRGIKERGHENIDEIEEKAKSLATCTYELRTQWGNEIGVKYGCPVEDVITGLAIHCRGWESVYMEPQRAAFVGVAPATLAQTILQHKRWSEGNFTIFLSKHNTFLFGHGKISLQLQMGYCIYGLWAANSLPTIYYVMIPALGLVKGTPLFPEIMSPWATPFIYVFCVKTLYSLYEALLSGDTLKGWWNGQRMWMVKRITSYLYGFIDTIRKLLGLSKMSFEITAKVSDGDEAKRYEQEILEFGSSSPEFVIIATVALLNFVCLVAGLSKIMAGVWNVFLPQVILCGLIVITNIPIYEAMFVRKDKGRIPLPVTLASIGFVMLAFLLPIV

 

 

 

 

OsCslF summary

 

There are eight CSLF genes in rice: F1, F2, F3, F4, F6, F7, F8, F9. All have one or two introns. Intron positions are conserved.

 

F1 and F2 proteins are very similar.

 

F1, F2, F3, and F4 are tightly clustered. F1-4 and F8 and F9 are on all on the same contig composed of AP004261 + AP005126 = 270 kb

 

The original OsCslF5 is a pseudogene. The Syngenta sequence (CLC6583) contains the same frame shifts and in-frame stop codon(s) found in the Monsanto sequence, and lacks any detectable carboxy terminus.

 

The reported sequence of the cDNA for F6 probably has errors in it, based on alignment with Syngenta genomic sequence.

 

The long N-terminal intron of F7 is probably real because amino acids flanking the first intron boundary are conserved.

 

Gene

intron 1

intron 2

aa length

GenBank BAC

cDNA

OsCslF1

no

yes

860

AP004261

(Xsome 7)  

 

OsCslF2

no

yes

889

AP004261

AP005126 

(Xsome 7)

 

OsCslF3

yes

yes

868

AP004261

(Xsome 7) 

 

OsCslF4

yes

yes

897

AP004261

(Xsome 7)  

 

OsCslF6

yes

yes

952

AP004635

(Xsome 8)

AF435645

AK065259

OsCslF7

yes

no

830

AC090441
AP008216

(Xsome 10)

NM_195879
AK110467

BK000091

OsCslF8

yes

yes

886

AP005126

(Xsome 7) 

 

OsCslF9

yes

yes

884

AP005126

(Xsome 7) 

 


 

>OsCslF1. 860 aa.

MSAAAAVTSWTNGCWSPAATRVNDGGKDDVWVAVDEADVSGARGSDGGGRPPLFQTYKVKGSILHPYRFLILARLIAIVAFFAWRIRHKNRDGAWLWTMSMVGDVWFGFSWVLNQLPKQSPIKRVPDIAALADRHSGDLPGVDVFVTTVDPVDEPILYTVNTILSILAADYPVDRYACYLSDDGGTLVHYEAMVEVAKFAELWVPFCRKHCVEPRSPENYFAMKTQAYKGGVPGELMSDHRRVRREYEEFKVRIDSLSSTIRQRSDVYNAKHAGENATWMADGTHWPGTWFEPADNHQRGKHAGIVQVLLNHPSCKPRLGLAASAENPVDFSGVDVRLPMLVYISREKRPGYNHQKKAGAMNVMLRVSALLSNAPFVINFDGDHYVNNSQAFRAPMCFMLDGRGRGGENTAFVQFPQRFDDVDPTDRYANHNRVFFDGTMLSLNGLQGPSYLGTGTMFRRVALYGVEPPRWGAAASQIKAMDIANKFGSSTSFVGTMLDGANQERSITPLAVLDESVAGDLAALTACAYEDGTSWGRDVGWVYNIATEDVVTGFRMHRQGWRSVYASVEPAAFRGTAPINLTERLYQILRWSGGSLEMFFSHSNALLAGRRLHPLQRVAYLNMSTYPIVTVFIFFYNLFPVMWLISEQYYIQRPFGEYLLYLVAVIAMIHVIGMFEVKWAGITLLDWCRNEQFYMIGSTGVYPTAVLYMALKLVTGKGIYFRLTSKQTAASSGDKFADLYTVRWVPLLIPTIVIMVVNVAAVGVAVGKAAAWGPLTEPGWLAVLGMVFNVWILVLLYPFALGVMGQWGKRPAVLFVAMAMAVAAVAAMYVAFGAPYQAELSGVAASLGKVAAASLTGPSG

 

>OsCslF2. 889 aa.

MAATAASTMSAAAAVTRRINAALRVDATSGDVAAGADGQNGRRSPVAKRVNDGGGGKDDVWVAVDEKDVCGARGGDGAARPPLFRTYKVKGSILHPYRFLILLRLIAIVAFFAWRVRHKNRDGVWLWTMSMVGDVWFGFSWVLNQLPKLSPIKRVPDLAALADRHSGDLPGVDVFVTTVDPVDEPILYTVNTILSILAADYPVDRYACYLSDDGGTLVHYEAMVEVAKFAELWVPFCRKHCVEPRSPENYFAMKTQAYKGGVPGELMSDHRRVRREYEEFKVRIDSLSSTIRQRSDVYNAKHAGENATWMADGTHWPGTWFEPADNHQRGKHAGIVQVLLNHPSCKPRLGLAASAENPVDFSGVDVRLPMLVYISREKRPGYNHQKKAGAMNVMLRVSALLSNAPFVINFDGDHYVNNSQAFRAPMCFMLDGRGRGGENTAFVQFPQRFDDVDPTDRYANHNRVFFDGTMLSLNGLQGPSYLGTGTMFRRVALYGVEPPRWGAAASQIKAMDIANKFGSSTSFVGTMLDGANQERSITPLAVLDESVAGDLAALTACAYEDGTSWGRDVGWVYNIATEDVVTGFRMHRQGWRSVYASVEPAAFRGTAPINLTERLYQILRWSGGSLEMFFSHSNALLAGRRLHPLQRVAYLNMSTYPIVTVFIFFYNLFPVMWLISEQYYIQRPFGEYLLYLVAVIAMIHVIGMFEVKWAGITLLDWCRNEQFYMIGSTGVYPTAVLYMALKLVTGKGIYFRLTSKQTTASSGDKFADLYTVRWVPLLIPTIVIIVVNVAAVGVAVGKAAAWGPLTEPGWLAVLGMVFNVWILVLLYPFALGVMGQWGKRPAVLFVAMAMAVAAVAAMYVAFGAPYQAELSGGAASLGKAAASLTGPSG

 

>OsCslF3. 868 aa.

MASPASVAGGGEDSNGCSSLIDPLLVSRTSSIGGAERKAAGGGGGGAKGKHWAAADKGERRAAKECGGEDGRRPLLFRSYRVKGSLLHPyrALIFARLIAVLLFFGWRIRHNNSDIMWFWTMSVAGDVWFGFSWLLNQLPKFNPVKTIPDLTALRQYCDLADGSYRLPGIDVFVTTADPIDEPVLYTMNCVLSILAADYPVDRSACYLSDDSGALILYEALVETAKFATLWVPFCRKHCIEPRSPESYFELEAPSYTGSAPEEFKNDSRIVHLEYDEFKVRLEALPETIRKRSDVYNSMKTDQGAPNATWMANGTQWPGTWIEPIENHRKGHHAGIVKVVLDHPIRGHNLSLKDSTGNNLNFNATDVRIPMLVYVSRGKNPNYDHNKKAGALNAQLRASALLSNAQFIINFDCDHYINNSQAFRAAICFMLDQREGDNTAFVQFPQRFDNVDPKDRYGNHNRVFFDGTMLALNGLQGPSYLGTGCMFRRLALYGIDPPHWRQDNITPEASKFGNSILLLESVLEALNQDRFATPSPVNDIFVNELEMVVSASFDKETDWGKGVGYIYDIATEDIVTGFRIHGQGWRSMYCTMEHDAFCGTAPINLTERLHQIVRWSGGSLEMFFSHNNPLIGGRRLQPLQRVSYLNMTIYPVTSLFILLYAISPVMWLIPDEVYIQRPFTRYVVYLLVIILMIHMIGWLEIKWAGITWLDYWRNEQFFMIGSTSAYPTAVLHMVVNLLTKKGIHFRVTSKQTTADTNDKFADLYEMRWVPMLIPTMVVLVANIGAIGVAIGKTAVYMGVWTIAQKRHAAMGLLFNMWVMFLLYPFALAIMGRWAKRSIILVVLLPIIFVIVALVYVATHILLANIIPF

 

>OsCslF4. 897 aa.
MELATASTMSAAAVTRRINAGGLRVEVTNGNGAAGVYVAAAAAPCSPAAKRVNDGGGKDDVWVAVDEADVSGPSGGDGVRPTLFRTYKVKGSILHPYRFLILVRLIAIVAFFAWRVRHKNRDGAWLWTMSMAGDVWFGFSWALNQLPKLNPIKRVADLAALADRQQHGTSGGGELPGVDVFVTTVDPVDEPILYTVNSILSILAADYPVDRYACYLSDDGGTLVHYEAMVEVAKFAELWVPFCRKHCVEPRAPESYFAMKTQAYRGGVAGELMSDRRRVRREYEEFKVRIDSLFSTIRKRSDAYNRAKDGKDDGENATWMADGTHWPGTWFEPAENHRKGQHAGIVQVLLNHPTSKPRFGVAASVDNPLDFSGVDVRLPMLVYISREKRPGYNHQKKAGAMNALLRVSALLSNAPFIINFDCDHYVNNSQAFRAPMCFMLDRRGGGDDVAFVQFPQRFDDVDPTDRYANHNRVFFDGTTLSLNGLQGPSYLGTGTMFRRAALYGLEPPRWGAAGSQIKAMDNANKFGASSTLVSSMLDGANQERSITPPVAIDGSVARDLAAVTACGYDLGTSWGRDAGWVYDIATEDVATGFRMHQQGWRSVYTSMEPAAFRGTAPINLTERLYQILRWSGGSLEMFFSHSNALLAGRRLHPLQRIAYLNMSTYPIVTVFIFFYNLFPVMWLISEQYYIQQPFGEYLLYLVAIIAMIHVIGMFEVKWSGITVLDWCRNEQFYMIGSTGVYPTAVLYMALKLFTGKGIHFRLTSKQTTASSGDKFADLYTVRWVPLLIPTIVVLAVNVGAVGVAVGKAAAWGLLTEQGRFAVLGMVFNVWILALLYPFALGIMGQRGKRPAVLFVATVMAVAAVAIMYAAFGAPYQAGLSGVAASLGKAASLTGPSG

 

>OsCslF6. 2 introns. 952 aa
MAPAVAGGGGRRNNEGVNGNAAAPACVCGFPVCACAGAAAVASAASSADMDIVAAGQIGAVNDESWVAVDLSDSDDAPAAGDVQGALDDRPVFRTEKIKGVLLHPYR[intron]VLIFVRLIAFTLFVIWRIEHKNPDAMWLWVTSIAGEFWFGFSWLLDQLPKLNPINRVPDLAVLRRRFDHADGTSSLPGLDIFVTTADPIKEPILSTANSILSILAADYPVDRNTCYLSDDSGMLLTYEAMAEAAKFATLWVPFCRKHAIEPRGPESYFELKSHPYMGRAQEEFVNDRRRVRKEYDDFKARINGLEHDIKQRSDSYNAAAGVKDGEPRATWMADGSQWEGTWIEQSENHRKGDHAGIVL[intron]VLLNHPSHARQLGPPASADNPLDFSGVDVRLPMLVYVAREKRPGCNHQKKAGAMNALTRASAVLSNSPFILNLDCDHYINNSQALRAGICFMLGRDSDTVAFVQFPQRFEGVDPTDLYANHNRIFFDGTLRALDGLQGPIYVGTGCLFRRITLYGFEPPRINVGGPCFPRLGGMFAKNRYQKPGFEMTKPGAKPVAPPPAATVAKGKHGFLPMPKKAYGKSDAFADTIPRASHPSPYAAEAAVAADEAAIAEAVMVTAAAYEKKTGWGSDIGWVYGTVTEDVVTGYRMHIKGWRSRYCSIYPHAFIGTAPINLTERLFQVLRWSTGSLEIFFSRNNPLFGSTFLHPLQRVAYINITTYPFTALFLIFYTTVPALSFVTGHFIVQRPTTMFYVYLAIVLGTLLILAVLEVKWAGVTVFEWFRNGQFWMTASCSAYLAAVLQVVTKVVFRRDISFKLTSKLPAGDEKKDPYADLYVVRWTWLMITPIIIILVNIIGSAVAFAKVLDGEWTHWLKVAGGVFFNFWVLFHLYPFAKGILGKHGKTPVVVLVWWAFTFVITAVLYINIPHIHGPGRHGAASPSHGHHSAHGTKKYDFTYAWP

 

>OsCslF7

MPPSAGLATESLPAATCPAKKDAYAAAASPESETKLAAGDERAPLVRTTRISTTTIKLYRLTIFVRIAIFVLFFKWRITYAARAISSTDAGGIGMSKAATFWTASIAGELWFAFMWVLDQLPKTMPVRRAVDVTALNDDTLLPAMDVFVTTADPDKEPPLATANTVLSILAAGYPAGKVTCYVSDDAGAEVTRGAVVEAARFAALWVPFCRKHGVEPRNPEAYFNGGEGGGGGGKARVVARGSYKGRAWPELVRDRRRVRREYEEMRLRIDALQAADARRRRCGAADDHAGVVQVLIDSAGSAPQLGVADGSKLIDLASVDVRLPALVYVCREKRRGRAHHRKAGAMNALLRASAVLSNAPFILNLDCDHYVNNSQALRAGICFMIERRGGGAEDAGDVAFVQFPQRFDGVDPGDRYANHNRVFFDCTELGLDGLQGPIYVGTGCLFRRVALYGVDPPRWRSPGGGVAADPAKFGESAPFLASVRAEQSHSRDDGDAIAEASALVSCAYEDGTAWGRDVGWVYGTVTEDVATGFCMHRRGWRSAYYAAAPDAFRGTAPINLADRLHQVLRWAAGSLEIFFSRNNALLAGGRRRLHPLQRAAYLNTTVYPFTSLFLMAYCLFPAIPLIAGGGGWNAAPTPTYVAFLAALMVTLAAVAVLETRWSGIALGEWWRNEQFWMVSATSAYLAAVAQVALKVATGKEISFKLTSKHLASSATPVAGKDRQYAELYAVRWTALMAPTAAALAVNVASMAAAGGGGRWWWWDAPSAAAAAAAALPVAFNVWVVVHLYPFALGLMGRRSKAVRPILFLFAVVAYLAVRFLCLLLQFHTA

 

 

>OsCslF8. 2 introns.  886 aa
MAANGGGGGAGGCSNGGGGGAVNGAAANGGGGGGGGSKGATTRRAKVSPMDRYWVPTDEKEMAAAVADGGEDGRRPLLFRTFTVRGILLHPYRLLTLVRLVAIVLFFIWRIRHPYADGMFFWWISVIGDFWFGVSWLLNQVAKLKPIRRVPDLNLLQQQFDLPDGNSNLPGLDVFINTVDPINEPMIYTMNAILSILAADYPVDKHACYLSDDGGSIIHYDGLLETAKFAALWVPFCRKHSIEPRAPESYFAVKSRPYAGSAPEDFLSDHRYMRREYDEFKVRLDALFTVIPKRSDAYNQAHAEEGVKATWMADGTEWPGTWIDPSENHKKGNHAGIVQVMLNHPSNQPQLGLPASTDSPVDFSNVDVRLPMLVYIAREKRPGYDHQKKAGAMNVQLRVSALLTNAPFIINFDGDHYVNNSKAFRAGICFMLDRREGDNTAFVQFPQRFDDVDPTDRYCNHNRVFFDATLLGLNGIQGPSYVGTGCMFRRVALYGVDPPRWRPDDGNIVDSSKKFGNLDSFISSIPIAANQERSIISPPALEESILQELSDAMACAYEDGTDWGKDVGWVYNIATEDVVTGFRLHRTGWRSMYCRMEPDAFRGTAPINLTERLYQILRWSGGSLEMFFSHNCPLLAGRRLNFMQRIAYINMTGYPVTSVFLLFYLLFPVIWIFRGIFYIQKPFPTYVLYLVIVIFMSEMIGMVEIKWAGLTLLDWIRNEQFYIIGATAVYPLAVLHIVLKCFGLKGVSFKLTAKQVASSTSEKFAELYDVQWAPLLFPTIVVIAVNICAIGAAIGKALFGGWSLMQMGDASLGLVFNVWILLLIYPFALGIMGRWSKRPYILFVLIVISFVIIALADIAIQAMRSGSVRLHFRRSGGANFPTSWGF

 

>OsCslF9. 2 introns.   884 aa.  protein BAC80027.
MALSPAAAGRTGRNNNNDAGLADPLLPAGGGGGGGKDKYWVPADEEEEICRGEDGGRPPAPPLLYRTFKVSGVLLHPYRLLTLVRLIAVVLFLAWRLKHRDSDAMWLWWISIAGDFWFGVTWLLNQASKLNPVKRVPDLSLLRRRFDDGGLPGIDVFINTVDPVDEPMLYTMNSILSILATDYPADRHAAYLSDDGASLAHYEGLIETARFAALWVPFCRKHRVEPRAPESYFAAKAAPYAGPALPEEFFGDRRLVRREYEEFKARLDALFTDIPQRSEASVGNANTKGAKATLMADGTPWPGTWTEPAENHKKGQHAGIVKVMLSHPGEEPQLGMPASSGHPLDFSAVDVRLPILVYIAREKRPGYDHQKKAGAMNAQLRVSALLSNAPFIFNFDGDHYINNSQAFRAALCFMLDCRHGDDTAFVQFPQRFDDVDPTDRYCNHNRVFFDATLLGLNGVQGPSYVGTGCMFRRVALYGADPPRWRPEDDDAKALGCPGRYGNSMPFINTIPAAASQERSIASPAAASLDETAAMAEVEEVMTCAYEDGTEWGDGVGWVYDIATEDVVTGFRLHRKGWRSMYCAMEPDAFRGTAPINLTERLYQILRWSGGSLEMFFSRNCPLLAGCRLRPMQRVAYANMTAYPVSALFMVVYDLLPVIWLSHHGEFHIQKPFSTYVAYLVAVIAMIEVIGLVEIKWAGLTLLDWWRNEQFYMIGATGVYLAAVLHIVLKRLLGLKGVRFKLTAKQLAGGARERFAELYDVHWSPLLAPTVVVMAVNVTAIGAAAGKAVVGGWTPAQVAGASAGLVFNVWVLVLLYPFALGIMGRWSKRPCALFALLVAACAAVAAGFVAVHAVLAAGSAAPSWLGWSRGATAILPSSWRLKRGF

 

 

OsCSLH summary

 

 

1. OsCslH2 and OsCslH3 are both on BAC AL606632

 

2. Intron structure is conserved. Seven common introns are in the same locations in all three genes. CslH1 has one additional intron in its N terminus, and CslH2 lacks an exon present in both CslH2 and CslH3 (this exon is indicated in bold in the sequence of H3).

 

3. The CslH proteins are similar in size and sequence, and they group together and separately from the Arabidopsis CslB's.

 

 

>OsCslH1   8 introns.   750 aa

MEAAARGNKKLQERVPIRRTAWRLADLAILFLLLALLLHRVLHDSGAPWRRAALACEAWFTFMWLLNVNAKWSPVRFDTFPENLAERIDELPAVDMFVTTADPVLEPPLVTVNTVLSLLALDYPAAGEKLACYVSDDGCSPLTCYALREAARFARTWVPFCRRHGVAVRAPFRYFSSTPEFGPADGKFLEDWTFMKSEYEKLVHRIEDADEPSLLRHGGGEFAEFLDVERGNHPTIIKVLWDNNRSRTGDGFPRLIYVSREKSPNLHHHYKAGAMNALTRVSALMTNAPFMLNLDCDMFVNNPRVVLHAMCLLLGFDDEISCAFVQTPQKFYGALKDDPFGNQLEVSLMKVGRGIAGLQGIFYCGTGCFHRRKVIYGMRTGREGTTGYSSNKELHSKFGSSNNFKESARDVIYGNLSTEPIVDISSCVDVAKEVAACNYEIGTCWGQEVGWVYGSLTEDVLTGQRIHAAGWRSTLMEIEPPAFMGCAPNGGPACLTQLKRWASGFLEILISRNNPILTTTFKSLQFRQCLAYLHSYVWPVRAPFELCYALLGPYCLLSNQSFLPKTSEDGFYIALALFIAYNTYMFMEFIECGQSARACWNNHRMQRITSASAWLLAFLTVILKTLGFSETVFEVTRKDKSTSDGDSNTDEPEPGRFTFDESTVFIPVTALAMLSVIAIAVGAWRVVLVTTEGLPGGPGISEFISCGWLVLCFMPLLRGLVGSGRYGIPWSIKMKACLLVAIFLLFCKRN

 

>OsCslH2. 762 aa. 7 introns.

MAVVAAAAATGSTTRSGGGGGEGTRSGRKKPPPPPLQERVPLGRRAAWAWRLAGLAVLLLLLALLALRLLRHHGGAGGDAGVWRVALVCEAWFAALCALNVSAKWSPVRFVTRPENLVAEGRTPSTTAAEYGELPAVDMLVTTADPALEPPLVTVNTVLSLLALDYPRAGERLACYVSDDGCSPLTCHALREAAGFAAAWVPFCRRYGVAVRAPFRYFSSSSSPESGGPADRKFLDDWTFMKDEYDKLVRRIKNTDERSLLRHGGGEFFAEFLNVERRNHPTIVKTRVSAVMTNAPIMLNMDCDMFVNNPQAVLHAMCLLLGFDDEASSGFVQAPQRFYDALKDDPFGNQMECFFKRFISGVQGVQGAFYAGTGCFHRRKAVYGVPPNFNGAEREDTIGSSSYKELHTRFGNSEELNESARNIIWDLSSKPMVDISSRIEVAKAVSACNYDIGTCWGQEVGWVYGSLTEDILTGQRIHAMGWRSVLMVTEPPAFMGSAPIGGPACLTQFKRWATGQSEIIISRNNPILATMFKRLKFRQCLAYLIVLGWPLRAPFELCYGLLGPYCILTNQSFLPKASEDGFSVPLALFISYNTYNFMEYMACGLSARAWWNNHRMQRIISVSAWTLAFLTVLLKSLGLSETVFEVTGKDKSMSDDDDNTDGADPGRFTFDSLPVFIPVTALAMLNIVAVTVGACRVAFGTAEGVPCAPGIGEFMCCGWLVLCFFPFVRGIVWGKGSYGIPWSVKLKASLLVAMFVTFCKRN

 

>OsCslH3  7 introns.     792 aa
MAAASGEKEEEEKKLQERAPIRRTAWMLANFVVLFLLLALLVRRATAADAEERGVGGAAWRVAFACEAWFAFVWLLNMNAKWSPARFDTYPENLAGRCGAAHRPRKSSCISGHLDLMRRQCALMQDRRAAGGRHVRDDGGPGARAAGGDGEQGALAARRRLLPGRRRRRRRRRLACYVSDDGCSPVTYYALREAAGFARTWVPFCRRHGVAVRAPFRYFASAPEFGPADRKFLDDWTFMK[intron]SEYDKLVRRIEDADETTLLRQGGGEFAEFMDAKRTNHRAIVK[intron]VIWDNNSKNRIGEEGGFPHLIYVSREKSPGHHHHYKAGAMNAL[intron]TRVSAVMTNAPIMLNVDCDMFANDPQVVLHAMCLLLGFDDEISSGFVQVPQSFYGDLKDDPFGNKLEVIYK[intron]GLFYGGTGCFHCRKAIYGIEPDSIVVGREGAA[intron]GSPSYKELQFKFESSEELKESARYIISGDMSGEPIVDISSHIEVAKEVSSCNYESGTHWGLE[intron]VGWAYGSMTEDILTGQRIHAAGWRSAKLETEPPAFLGCAPTGGPACLTQFKRWATGLFEILISQNNPLLLSIFKHLQFRQCLAYLTLYVWAVRGFVELCYELLVPYCLLTNQSFLSK[intron]ASENCFNITLALFLTYNTYNFVEYMECGLSVRAWWNNHRMQRIISASAWLLAFFTVLLKTIGLSETVFEVTRKEKSTSDGNGQNDEVDPERFTFDASPVFIPVTALTMLNIVAITIGTWRAVFGTTEDVPGGPGISEFMSCGWLLLCLLPFVRGLVGKGSYGIPWSVKLKASLLVALFLFCSNRN

 

_____________________________________________________________________________

 

Full length cDNAs for rice CSL genes: see Science 301:376.

OsCslA1

AK102694 full length

AK059580

2 sequences are identical; AK102694 is longer.

OsCslA2

 

 

OsCslA3

 

 

OsCslA5

AK111424

 

OsCslA6

 

 

OsCslA7

AK064833

 

OsCslA9

 

 

OsCslA11

 

 

OsCslA12**

AK107635

New. Probably pseudogene [see below]**

OsCslC1

AK110759

 

 

OsCslC2

 

 

OsCslC3

AK108045

 

OsCslC7

 

 

OsCslC9

 

 

OsCslC10

 

 

OsCslD1

 

 

OsCslD2

AK105393 (4089 bp) AK102134 (3968 bp) AK102695 (2757 bp)

All 3 are identical over their shared range. AK102134 extends furthest upstream.

OsCslD3

 

 

OsCslD4

 

 

OsCslD5

AK072260

 

OsCslE1

AK102766

several discrepancies; needs more work

OsCslE2

AK101487

 

OsCslE6

 

 

OsCslF1

 

 

OsCslF2

AK100523, AK066835

AK100523 has 1 aa discrepancy, AK066835 is exact match.

OsCslF3

 

 

OsCslF4

 

 

OsCslF6

AK065259

 

OsCslF7

AK110467

1 aa discrepancy at C terminus

OsCslF8

AK067424

 

OsCslF9

no

 

OsCslH1

AK069071, AK060286, AK061162

all 3 are identical

OsCslH2

no

 

OsCslH3

no

 


**OsCslA12: There is an interesting potential new CSLA represented in the new full length cDNA collection. We tentatively call this OsCslA12. It is accession AK107635 and corresponds to two genomic BACs, AP006162 and AP006450. There are two T missing from the cDNA compared to AP006162 in the 5' UTR (which doesn't affect the predicted coding region); otherwise all three sequences appear identical (except of course for introns). HOWEVER, the predicted aa sequence of OsCslA12, based on the cDNA, does not align well with the known CslA and CslC proteins. It would appear that the cDNA has a frame shift, because the good alignment stops halfway through the protein (after FRK) and then picks up in a different frame (near WKK). Unfortunately, this simple explanation doesn't hold because in this region the cDNA and both genomic sequences are in 100% agreement. Either this CSL is truly drastically different than all the others in its carboxy terminus, or it is a pseudogene. Also, the gene lacks intron 6 (and maybe others; intron structure has not been completely elucidated).
____________________________________________________________________

 

 

CESA proteins. CESA1-8 are identical to Richmond’s web page. His CESA9 (partial) appears to be redundant.

 

>OsCESA1. 1076 aa.

MAANAGMVAGSRNRNEFVMIRPDGDAPPPAKPGKSVNGQVCQICGDTVGVSATGDVFVACNECAFPVCRPCYEYERKEGNQCCPQCKTRYKRHKGSPRVQGDEEEEDVDDLDNEFNYKHGNGKGPEWQIQRQGEDVDLSSSSRHEQHRIPRLTSGQQISGEIPDASPDRHSIRSGTSSYVDPSVPVPVRIVDPSKDLNSYGINSVDWQERVASWRNKQDKNMMQVANKYPEARGGDMEGTGSNGEDIQMVDDARLPLSRIVPIPSNQLNLYRIVIILRLIILMFFFQYRVTHPVRDAYGLWLVSVICEIWFALSWLLDQFPKWYPINRETYLDRLALRYDREGEPSQLAPIDVFVSTVDPLKEPPLITANTVLSILAVDYPVDKVSCYVSDDGSAMLTFEALSETAEFARKWVPFCKKHNIEPRAPEFYFAQKIDYLKDKIQPSFVKERRAMKREYEEFKVRINALVAKAQKVPEEGWTMADGTAWPGNNPRDHPGMIQVFLGHSGGLDTDGNELPRLVYVSREKRPGFQHHKKAGAMNALIRVSAVLTNGAYLLNVDCDHYFNSSKALREAMCFMMDPALGRKTCYVQFPQRFDGIDLHDRYANRNIVFFDINMKGLDGIQGPVYVGTGCCFNRQALYGYDPVLTEADLEPNIVVKSCCGGRKKKSKSYMDSKNRMMKRTESSAPIFNMEDIEEGIEGYEDERSVLMSQKRLEKRFGQSPIFIASTFMTQGGIPPSTNPASLLKEAIHVISCGYEDKTEWGKEIGWIYGSVTEDILTGFKMHARGWISIYCMPPRPCFKGSAPINLSDRLNQVLRWALGSVEILLSRHCPIWYGYNGRLKLLERLAYINTIVYPITSIPLIAYCVLPAICLLTNKFIIPEISNYAGMFFILLFASIFATGILELRWSGVGIEDWWRNEQFWVIGGTSAHLFAVFQGLLKVLAGIDTNFTVTSKASDEDGDFAELYVFKWTSLLIPPTTVLVINLVGMVAGISYAINSGYQSWGPLFGKLFFSIWVILHLYPFLKGLMGRQNRTPTIVIVWSILLASIFSLLWVKIDPFISPTQKAVALGQCGVNC

 

>OsCESA2. 1073 aa.

MDGAKSGKQCHVCQICGDGVGTAADGELFTACDVCGFPVCRPCYEYERKDGSQACPQCKTKYKRHKGSPPILGDESDDVDADDASDVNYPTSGNQDHKHKIAERMLTWRMNSGRNDDIVHSKYDSGEIGHPKYDSGEIPRIYIPSLTHSQISGEIPGASPDHMMSPVGNIGRRGHPFPYVNHSPNPSREFSGSLGNVAWKERVDGWKMKDKGAIPMANGTSIAPSEGRGVGDIDASTDYNMEDALLNDETRQPLSRKVPISSSRINPYRMVIVLRLIVLCIFLHYRITNPVRNAYPLWLLSVICEIWFALSWILDQFPKWSPINRETYLDRLALRYDREGEPSQLAPVDIFVSTVDPMKEPPLVTANTVLSILAVDYPVDKVSCYVSDDGAAMLTFDALAETSEFARKWVPFCKKYSIEPRAPEWYFAQKIDYLKDKVQASFVKDRRAMKREYEEFKVRVNALVAKAQKVPEEGWIMQDGTPWPGNNTRDHPGMIQVFLGHSGGLDTEGNELPRLVYVSREKRPGFQHHKKAGAMNALVRVSAVLTNGQYLLNLDCDHYINNSKALREAMCFLMDPNLGRRVCYVQFPQRFDGIDRNDRYANRNTVFFDINLRGLDGLQGPVYVGTGCVFNRTALYGYEPPIKQKRPGYFSSLCGGRKKTKKSKEKSTEKKKSHKHVDSSVPVFNLEDIEEGIEGSGFDDEKSLLMSQMSLEKRFGQSSVFVASTLMEYGGVPQSATPESLLKEAIHVISCGYEDKSDWGTEIGWIYGSVTEDILTGFKMHARGWRSIYCMPKRPAFKGSAPINLSDRLNQVLRWALGSVEILFSRHCPIWYGYGGRLKFLERFAYINTTIYPLTSIPLLLYCILPAICLLTGKFIIPEISNFASIWFISLFLSIFATGILEMRWSGVGIDEWWRNEQFWVIGGISAHLFAVFQGLLKVLAGIDTSFTVTSKASDEEGDFAELYMFKWTTLLIPPTTILIINLVGVVAGISYAINSGYQSWGPLFGKLFFAFWVIVHLYPFLKGLMGRQNRTPTIVVVWAILLASIFSLLWVRIDPFTTRVTGPDTQKCGINC

 

>OsCESA3. 1093 aa.

MEASAGLVAGSHNRNELVVIRRDGDPGPKPLRQQNGQVCQICGDDVGLNPDGEPFVACNECAFPVCRDCYEYERREGTQNCPQCKTRFKRLRGCARVPGDEEEDGVDDLENEFNWRDRNDSQYVAESMLHAHMSYGRGGVDVNGVPQPFQPNPNVPLLTDGQMVDDIPPEQHALVPSFMGGGGKRIHPLPYADPNLPVQPRSMDPSKDLAAYGYGSVAWKERMESWKQKQERLHQMRNDGGGKDWDGDGDDGDLPLMDEARQPLSRKVPIPSSQINPYRMVIIIRLVVLGFFFHYRVMHPVPDAFALWLISVICEIWFAMSWILDQFPKWFPIERETYLDRLTLRFDKEGQTSQLAPIDFFVSTVDPLKEPPLVTANTVLSILAVDYPVDKVSCYVSDDGAAMLTFEALSETSEFAKKWVPFCKKYSIEPRAPEWYFQQKIDYLKDKVAPYFVRERRAMKREYEEFKVRINALVAKAQKVPEEGWTMQDGTPWPGNNVRDHPGMIQVFLGQSGGHDIEGNELPRLVYVSREKRPGYNHHKKAGAMNALVRVSAVLTNAPYMLNLDCDHYINNSKAIKEAMCFMMDPLVGKKVCYVQFPQRFDGIDRHDRYANRNVVFFDINMKGLDGIQGPIYVGTGCVFRRQALYGYDAPKTKKPPSRTCNCWPKWCICCCCFGDRKSKKKTTKPKTEKKKRSFFKRAENQSPAYALGEIEEGAPGAENEKAGIVNQQKLEKKFGQSSVFVASTLLENGGTLKSASPASLLKEAIHVISCGYEDKTDWGKEIGWIYGSVTEDILTGFKMHCHGWRSIYCIPKLPAFKGSAPLNLSDRLHQVLRWALGSVEIFFSNHCPLWYGYGGGLKCLERFSYINSIVYPFTSIPLLAYCTLPAICLLTGKFITPELTNVASLWFMSLFICIFATGILEMRWSGVGIDDWWRNEQFWVIGGVSSHLFALFQGLLKVIAGIDTSFTVTSKGGDDEEFSELYTFKWTTLLIPPTTLLLLNFIGVVAGVSNAINNGYESWGPLFGKLFFAFWVIVHLYPFLKGLVGRQNRTPTIVIVWSILLASIFSLLWVRIDPFLAKNDGPLLEECGLDCN

 

>OsCESA4. 989 aa

MMESGVPPCAACGDDAHAACRACSYALCKACLDEDAAEGRTTCARCGGEYGAPDPAHGQGAVVEEEVEESHEPAAGGVRERVTMASQLSDHQDEGVHARTMSTHARTISSVSGVGSELNDESGKPIWKNRVESWKEKKKEKKASAKKAAAKAQAPPVEEQIMDEKDLTDAYEPLSRIIPISKNKLTPYRAVIIMRLVVLGLFFHYRITNPVYSAFGLWMTSVICEIWFGFSWILDQFPKWCPINRETYVDRLIARYGDGEDSGLAPVDFFVSTVDPLKEPPLITANTVLSILAVDYPVEKISCYVSDDGSAMLTFESLAETAEFARRWVPFCKKYSIEPRAPEFYFSQKIDYLKDKIHPSFVKERRAMKRDYEEYKVRINALVAKAQKTPEEGWIMQDGTPWPGNNPRDHPGMIQVFLGETGARDFDGNELPRLVYVSREKRPGYQHHKKAGAMNALVRVSAVLTNAPYILNLDCDHYVNNSKAVREAMCFMMDPSVGRDVCYVQFPQRFDGIDRSDRYANRNVVFFDVNMKGLDGLQGPVYVGTGCCFYRQALYGYGPPSLPALPKSSVCSWCCCCCPKKKAEKSEKEMHRDSRREDLESAIFNLREIDNYDEYERSMLISQMSFEKSFGLSSVFIESTLMENGGVPESANPSTLIKEAIHVISCGYEEKTEWGKEIGWIYGSVTEDILTGFKMHCRGWRSIYCMPIRPAFKGSAPINLSDRLHQVLRWALGSVEIFLSRHCPLWYGYGGGRLKWLQRLSYINTIVYPFTSLPLIAYCCLPAICLLTGKFIIPTLSNAATIWFLGLFISIIVTSVLELRWSGIGIEDWWRNEQFWVIGGVSAHLFAVFQGILKMIAGLDTNFTVTAKATDDTEFGELYVFKWTTVLIPPTSILVLNLVGVVAGFSDALNSGYESWGPLFGKVFFAMWVIMHLYPFLKGLMGRQNRTPTIVVLWSVLLASVFSLLWVKIDPFIGSSETTTTNSCANFDC

 

 

>OsCESA5. 1092 aa.

MEASAGLVAGSHNRNELVVIRRDGEPGPKPVKHTNGQVCQICGDDVGLTPDGEPFVACNECAFPVCRDCYEYERREGTQNCPQCKTRFKRLKGCARVPGDEEEEDVDDLENEFNWRDKTDSQYVAESMLHGHMSYGRGGDLDGVPQHFQPIPNVPLLTNGEMADDIPPEQHALVPSFMGGGGKRIHPLPYADPNLPVQPRSMDPSKDLAAYGYGSVAWKERMESWKQKQERLHQMRNDGGGKDWDGDGDDADLPLMDEARQPLSRKIPISSSLVNPYRMIIIIRLVVLGFFFHYRVMHPVPDAFALWLISVICEIWFAMSWILDQFPKWFPIERETYLDRLTLRFDKEGQQSQLAPVDFFVSTVDPMKEPPLVTANTVLSILAVDYPVDKVSCYVSDDGAAMLTFEALSETSEFAKKWVPFCKRYSLEPRAPEWYFQQKIDYLKDKVAPNFVRERRAMKREYEEFKVRINALVAKAQKVPEEGWTMQDGTPWPGNNVRDHPGMIQVFLGQSGGHDVEGNELPRLVYVSREKRPGYNHHKKAGAMNALVRVSAVLTNAPYMLNLDCDHYINNSKAIKEAMCFMMDPLVGKKVCYVQFPQRFDGIDRHDRYANRNVVFFDINMKGLDGIQGPIYVGTGCVFRRQALYGYDAPKSKKPPSRTCNCWPKWCICCCCFGNRTNKKKTAKPKTEKKKRLFFKRAENQSPAYALGEIDEGAPGAENEKAGIVNQQKLEKKFGQSSVFVASTLLENGGTLKSASPASLLKEAIHVISCGYEDKTDWGKEIGWIYGSVTEDILTGFKMHCHGWRSIYCIPKRAAFKGSAPLNLSDRLHQVLRWALGSIEIFFSNHCPLWYGYGGGLKCLERFSYINSIVYPWTSIPLLAYCTLPAICLLTGKFITPELTNIASLWFMSLFICIFATGILEMRWSGVGIDDWWRNEQFWVIGGVSSHLFAVFQGLLKVIAGIDTSFTVTSKGGDDEEFSELYTFKWTTLLIPPTTLLLLNFIGVVAGVSNAINNGYESWGPLFGKLFFAFWVIVHLYPFLKGLVGRQNRTPTIVIVWSILLASIFSLLWVRIDPFLAKNDGPLLEECGLDCN

 

>OsCESA6. 1092 aa.

MEASAGLVAGSHNRNELVVIRRDGGGGGGVGGRRATEAKTACQICGDDVGEGPDGEPFVACNECAFPVCRNCYDYERREGSQACPQCKTRFKRLKGCPRVAGDEEEDGVDDLEGEFGLDGREDDPQYIAESMLRANMSYGRGGDLQPFQPIPNVPLLTNGQMVDDIPPEQHALVPSYMGGGGGGGKRIHPLPFADPSVPVQPRSMDPSKDLAAYGYGSVAWKERMEGWKQKQERMQQLRSEGGGDWDGDGDADLPLMDEARQPLSRKVPISSSRINPYRMIIIIRLVVLGFFFHYRVMHPVNDAFALWLISVICEIWFAMSWILDQFPKWLPIERETYLDRLSLRFDKEGQPSQLAPVDFFVSTVDPSKEPPLVTANTVLSILSVDYPVEKVSCYVSDDGAAMLTFEALSETSEFAKKWVPFCKKFNIEPRAPEWYFQQKIDYLKDKVAASFVRERRAMKRDYEEFKVRINALVAKAQKVPEEGWTMQDGSPWPGNNVRDHPGMIQVFLGQSGGRDVEGNELPRLVYVSREKRPGYNHHKKAGAMNALVRVSAVLSNAPYLLNLDCDHYINNSKAIREAMCFMMDPLVGKKVCYVQFPQRFDGIDRHDRYANRNVVFFDINMKGLDGIQGPIYVGTGCVFRRQALYGYDAPKTKKPPSRTCNCWPKWCCCCCCGNRHTKKKTTKPKPEKKKRLFFKKAENQSPAYALGEIEEGAPGAETDKAGIVNQQKLEKKFGQSSVFVASTLLENGGTLKSASPASLLKEAIHVISCGYEDKTDWGKEIGWIYGSITEDILTGFKMHCHGWRSIYCIPKRPAFKGSAPLNLSDRLHQVLRWALGSVEIFFSKHCPLWYGYGGGLKFLERFSYINSIVYPWTSIPLLAYCTLPAICLLTGKFITPELTNVASLWFMSLFICIFVTGILEMRWSGVAIDDWWRNEQFWVIGGVSSHLFAVFQGLLKVLAGVDTSFTVTSKAGDDEEFSELYTFKWTTLLIPPTTLLLLNFIGVVAGVSNAINNGYESWGPLFGKLFFAFWVIVHLYPFLKGLVGRQNRTPTIVIVWSILLASIFSLLWVRIDPFLAKNNGPLLEECGLDCN

 

>OsCESA7. 1063 aa

MDTASVTGGEHKGKEKTCRVCGEEVAAREDGKPFVACAECGFPVCKPCYEYERSEGTQCCPQCNTRYKRHKGCPRVEGDEDDGGDMDDFEEEFQIKSPTKQKPPHEPVNFDVYSENGEQPAQKWRPGGPALSSFTGSVAGKDLEQEREMEGGMEWKDRIDKWKTKQEKRGKLNRDDSDDDDDKNDDEYMLLAEARQPLWRKVPIPSSKINPYRIVIVLRLVVLCFFLKFRITTPAMDAVPLWLASVICELWFALSWILDQLPKWSPVTRETYLDRLALRYERDGEPCRLAPIDFFVSTVDPLKEPPIITANTVLSILAVDYPVDRVSCYVSDDGASMLLFDTLSETAEFARRWVPFCKKFTIEPRAPEFYFSQKIDYLKDKVQPTFVKERRAMKREYEEFKVRINALVAKAQKKPEEGWVMQDGTPWPGNNTRDHPGMIQVYLGSQGALDVEGSELPRLVYVSREKRPGYNHHKKAGAMNSLVRVSAVLTNAPFILNLDCDHYVNNSKAVREAMCFLMDKQLGKKLCYVQFPQRFDGIDRHDRYANRNTVFFDINMKGLDGIQGPVYVGTGTVFNRQALYGYDPPRPEKRPKMTCDCWPSWCCCCCCFGGGKRGKSHKNKKGGGGGEGGGLDEPRRGLLGFYKKRSKKDKLGGGAASLAGGKKGYRKHQRGFELEEIEEGLEGYDELERSSLMSQKSFEKRFGQSPVFIASTLVEDGGLPQGAAADPAALIKEAIHVISCGYEEKTEWGKEIGWIYGSVTEDILTGFKMHCRGWKSVYCTPARAAFKGSAPINLSDRLHQVLRWALGSVEIFMSRHCPLWYAYGGRLKWLERFAYTNTIVYPFTSIPLLAYCTIPAVCLLTGKFIIPTLNNLASIWFIALFLSIIATGVLELRWSGVSIEDWWRNEQFWVIGGVSAHLFAVFQGLLKVLGGVDTNFTVTSKAAADETDAFGELYLFKWTTLLVPPTTLIIINMVGIVAGVSDAVNNGYGSWGPLFGKLFFSFWVILHLYPFLKGLMGRQNRTPTIVVLWSILLASIFSLVWVRIDPFIPKPKGPVLKPCGVSC

 

>OsCESA8. 1081 aa.

MDGDADAVKSGRHGSGQACQICGDGVGTTAEGDVFAACDVCGFPVCRPCYEYERKDGTQACPQCKTKYKRHKGSPAIRGEEGEDTDADDVSDYNYPASGSADQKQKIADRMRSWRMNAGGGGDVGRPKYDSGEIGLTKYDSGEIPRGYIPSVTNSQISGEIPGASPDHHMMSPTGNIGKRAPFPYVNHSPNPSREFSGSIGNVAWKERVDGWKLKQDKGAIPMTNGTSIAPSEGRGVGDIDASTDYNMEDALLNDETRQPLSRKVPLPSSRINPYRMVIVLRLVVLSIFLHYRITNPVRNAYPLWLLSVICEIWFALSWILDQFPKWFPINRETYLDRLALRYDREGEPSQLAAVDIFVSTVDPMKEPPLVTANTVLSILAVDYPVDKVSCYVSDDGAAMLTFDALAETSEFARKWVPFVKKYNIEPRAPEWYFSQKIDYLKDKVHPSFVKDRRAMKREYEEFKVRINGLVAKAQKVPEEGWIMQDGTPWPGNNTRDHPGMIQVFLGHSGGLDTEGNELPRLVYVSREKRPGFQHHKKAGAMNALVRVSAVLTNGQYMLNLDCDHYINNSKALREAMCFLMDPNLGRSVCYVQFPQRFDGIDRNDRYANRNTVFFDINLRGLDGIQGPVYVGTGCVFNRTALYGYEPPIKQKKKGSFLSSLCGGRKKASKSKKKSSDKKKSNKHVDSAVPVFNLEDIEEGVEGAGFDDEKSLLMSQMSLEKRFGQSAAFVASTLMEYGGVPQSATPESLLKEAIHVISCGYEDKTEWGTEIGWIYGSVTEDILTGFKMHARGWRSIYCMPKRPAFKGSAPINLSDRLNQVLRWALGSVEILFSRHCPIWYGYGGRLKFLERFAYINTTIYPLTSIPLLIYCVLPAICLLTGKFIIPEISNFASIWFISLFISIFATGILEMRWSGVGIDEWWRNEQFWVIGGISAHLFAVFQGLLKVLAGIDTNFTVTSKASDEDGDFAELYMFKWTTLLIPPTTILIINLVGVVAGISYAINSGYQSWGPLFGKLFFAFWVIVHLYPFLKGLMGRQNRTPTIVVVWAILLASIFSLLWVRIDPFTTRVTGPDTQTCGINC
 

>OsCESA9. 1055 aa.

MEASAGLVAGSHNRNELVLIRGHEEPKPLRALSGQVCEICGDEVGRTVDGDLFVACNECGFPVCRPCYEYERREGTQNCPQCKTRYKRLKGSPRVPGDEDEEDIDDLEHEFNIDDEKQKQLQQDQDGMQNSHITEAMLHGKMSYGRGPDDGDGNSTPLPPIITGARSVPVSGEFPISNSHGHGEFSSSLHKRIHPYPVSEPGSAKWDEKKEVSWKERMDDWKSKQGIVAGGAPDPDDYDADVPLNDEARQPLSRKVSIASSKVNPYRMVIILRLVVLGFFLRYRILHPVPDAIPLWLTSIICEIWFAVSWILDQFPKWYPIDRETYLDRLSLRYEREGEPSLLSAVDLFVSTVDPLKEPPLVTANTVLSILAVDYPVDKVSCYVSDDGASMLTFESLSETAEFARKWVPFCKKFSIEPRAPEFYFSQKVDYLKDKVHPNFVQERRAMKREYEEFKVRINALVAKAQKVPAEGWIMKDGTPWPGNNTRDHPGMIQVFLGHSGGHDTEGNELPRLVYVSREKRPGFQHHKKAGAMNALIRVSAVLTNAPFMLNLDCDHYINNSKAIREAMCFLMDPQVGRKVCYVQFPQRFDGIDVHDRYANRNTVFFDINMKGLDGIQGPVYVGTGCVFRRQALYGYNPPKGPKRPKMVTCDCCPCFGRKKRKHGKDGLPEAVAADGGMDSDKEMLMSQMNFEKRFGQSAAFVTSTLMEEGGVPPSSSPAALLKEAIHVISCGYEDKTDWGLELGWIYGSITEDILTGFKMHCRGWRSVYCMPKRAAFKGSAPINLSDRLNQVLRWALGSVEIFFSRHSPLLYGYKNGNLKWLERFSYINTTIYPFTSLPLLAYCTLPAVCLLTGKFIMPPISTFASLFFIALFISIFATGILEMRWSGVSIEEWWRNEQFWVIGGVSAHLFAVVQGLLKVLAGIDTNFTVTSKATGDEDDEFAELYAFKWTTLLIPPTTLLILNIIGVVAGVSDAINNGSEAWGPLFGKLFFAFWVIVHLYPFLKGLMGRQNRTPTIVVIWSVLLASIFSLLWVRIDPFTIKARGPDVRQCGINC

 

>possible OsCESA10. Os12g29300. Protein 11982.m06744. 244 amino acids. Either sequencing isn’t complete (BAC AL731763 has no annotation) or it’s a pseudogene. 

MDVFVTTADPDGIAALDDDALLPAMDVFVTTADPDKEPPLATANTVLSIYPRRGLPRRQVVQVLIDSAGSVPQLGVADGSKLIDVASVDVCLPALVYVCREKRRGHAHHRKAGAMNAPFILDLDCDHYVNNSQALRAGICFMIERGGGGAAEDAVAVAFVQFPQRVDGVDPSDRYANHNRVFFDCTELGLDGLQGPIYVGTGCLFRRVALYSVDLPRWRPRRSLGCRLLGEDERLWSRLKQMVI

 

>possible OsCESA11. AP003612. 860 aa. Os06g39970. No ESTs. Is shorter due to truncated N terminus compared to other CESA proteins. Can’t predict a better protein with any FGENESH model.
 

MDGESPEIMPVECPDPEPASSESGDDHDIPEPLSSRLSVPSGELNLYRAAVALRLVLLAAFFRYRVTRPVADAHALWVTSVACELWLAASWLIAQLPKLSPANRVTYLDRLASRYEKGGEASRLAGVDVFVAAADAAREPPLATANTVLSVLAADYPAGGVACYVHDDGADMLVFESLFEAAGFARRWIPFCRRHGVEPRAPELYFARGVDYLRDRAAPSFVKDRRAMKREYEEFKVRMNHLAARARKVPEEGWIMSDGTPWPGNNSRDHPAMIQVLLGHPGDRDVDGGELPRLFYVSREKRPGFRHHGKAGAMNALLRVSAVLTNGAYVLNLDCDHCVNNSSALREAMCFMMDPVAGNRTCFVQFALRDSGGGDSVFFDIEMKCLDGIQGPVYVGSGCCFSRKALYGFEPAAAADDGDDMDTAADWRRMCCFGRGKRMNAMRRSMSAVPLLDSEDDSDEQEEEEAAGRRRRLRAYRAALERHFGQSPAFIASAFEEQGRRRGGDGGSPDATVAPARSLLKEAIHVVSCAFEERTRWGKEIGWMYGGGVATGFRMHARGWSSAYCSPARPAFRRYARASPADVLAGASRRAVAAMGILLSRRHSPVWAGRRLGLLQRLGYVARASYPLASLPLTVYCALPAVCLLTGKSTFPSDVSYYDGVLLILLLFSVAASVALELRWSRVPLRAWWRDEKLWMVTATSASLAAVFQGILSACTGIDVAFSTETAASPPKRPAAGNDDGEEEAALASEITMRWTNLLVAPTSVVVANLAGVVAAVAYGVDHGYYQSWGALGAKLALAGWVVAHLQGFLRGLLAPRDRAPPTIAVLWSVVFVSVASLLWVHAASFSAPTAAPTTEQPIL

 

 

 

TIGR Rice Community Annotation

 

 

10-Mar-06

 

 

 

 

 

 

 

 

Name:

Jonathan Walton

 

Gene Family:

Cellulose synthase (CES) and Cellulose synthase-like (CSL)

 

 

 

Org:

Michigan State University

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

Locus

Gene Name

Gene Description

Alt Gene Name (from Genbank or TIGR)

GenBank Acc Genomic

GenBank Acc cDNA

GenBank Acc Protein

Mutant

Criteria

Source

Comment

Os05g08370

CESA1

cellulose synthase

CesA subunit 1

AC135426

AK100188

AAU44296

 

 

 

 

Os03g59340

CESA2

cellulose synthase

CesA subunit 3

AC135958

AK069196

AAP21426

 

 

 

 

Os07g24190

CESA3

cellulose synthase

CesA subunit 10

AP005248, AP004298

AK073561

BAD30574

 

 

 

 

Os01g54620

CESA4

cellulose synthase

CesA subunit 8

AP003237

AK100475

BAD87094

yes

TOS17

Tanaka et al. (2003) Plant Physiol. 133:73

 

Os03g62090

CESA5

cellulose synthase

CesA subunit 6

AC104487

AK100877

AAO41140

 

 

 

 

Os07g14850

CESA6

cellulose synthase

CesA subunit 6; putative cellulose synthase-8

AP005824

AK100914

BAC84511

 

 

 

 

Os10g32980

CESA7

cellulose synthase

CesA subunit 4

AC022457

AK072259

AAK27814

yes

TOS17

Tanaka et al. (2003) Plant Physiol. 133:73

 

Os07g10770

CESA8

cellulose synthase

CesA  subunit 4; cellulose synthase-4

AP003837

AK072356

BAC57282

 

 

 

 

Os09g25490

CESA9

cellulose synthase

CesA subunit 8

AP005579

AK121170

BAD33645

yes

TOS17

Tanaka et al. (2003) Plant Physiol. 133:73

 

Os12g29300

possible CesA10

cellulose synthase

CesA subunit 6

AL731763

none

none (the BAC is not annotated)

 

 

 

No ESTs. Protein only 244 aa. Pseudogene?

Os06g39970

possible CesA11

cellulose synthase

putative cellulose synthase-3

AP003612

none

BAD32845 

 

 

 

No ESTs. Protein only 860 aa. Pseudogene?

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

Os02g09930

CSLA1

cellulose synthase-like family A; mannan synthase

putative cellulose synthase-like protein OsCslA9

AP005785

AK102694

BAD34025

 

 

 

TIGR says this gene has alternative splicing.

Os10g26630

CSLA2

cellulose synthase-like family A; mannan synthase

glycosyl transferase family 2

AC021893

none

AAK98678

 

 

 

TIGR model is wrong.

Os06g12460

CSLA3

cellulose synthase-like family A; mannan synthase

glycosyltransferase 5

AP003509

none

BAD37274

 

 

 

 

Os03g07350

CSLA4

cellulose synthase-like family A; mannan synthase

glycosyl transferase family 2 protein

AC073556

none

AAL84294

 

 

 

OsCslA4 has an atypical intron border start of GC instead of GT near the 3’ end (according to TIGR).

Os03g26044

CSLA5

cellulose synthase-like family A; mannan synthase

glycosyltransferase

AC084766

AK110517, AK111424

AAL82530

 

 

 

 

Os02g51060

CSLA6

cellulose synthase-like family A; mannan synthase

glycosyltransferase 10

AP005297

AK058756

BAD16122, AAL25127

 

 

 

Genbank is wrong by adding 3 extra amino acids at an intron jxn. TA11428_4530 from TIGR confirms that the 3 aa are wrong.

Os07g43710

CSLA7

cellulose synthase-like family A; mannan synthase

glycosyltransferase 5

AP004260

AK122106   AK064833

BAC79726, AAL38528, XP_479231

 

 

 

 

Os06g42020

CSLA9

cellulose synthase-like family A

glycosyltransferase 5,  glycosyltransferase 1

AP008212  AP004737

none

BAD37742

 

 

 

 

Os08g33740

CSLA11

cellulose synthase-like family A

Glycosyltransferase

AP004666, AP005757

none

BAD09847                

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

Os01g56130

CSLC1

cellulose synthase-like family C

glycosyl transferase family 2, putative CSLC9

AP003377

AK110759

BAC10759

 

 

 

 

Os09g25900

CSLC2

cellulose synthase-like family C

glycosyl transferase family 2

AP005568

none

BAD33623 (wrong)

 

 

 

TA14765_4530; TIGR model is correct (698 aa); Genbank model is wrong

Os08g15420

CSLC3

cellulose synthase-like family C

glycosyl transferase family 2

AP004013

AK108045

 

 

 

 

 

Os05g43530

CSLC7

cellulose synthase-like family C

glycosyl transferase family 2; putative glucosyltransferase

AC108873

none

AAT44138 

 

 

 

 

Os03g56060

CSLC9

cellulose synthase-like family C

glycosyl transferase family 2

AC133450

AK121805

AAT85054 (has AAA); AF435641 (has AA)

 

 

 

FL cDNA and some ESTs don't agree. 595 vs 596 aa

Os07g03260

CSLC10

cellulose synthase-like family C

glycosyl transferase family 2; putative CSLC9(cellulose synthase-like)

AP005309 

none

BAC56816

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

Os10g42750

CSLD1

cellulose synthase-like family D

cellulose synthase-like protein D4; putative cellulose synthase

AC027037

AK110534

AAL58185

 

 

 

 

Os06g02180

CSLD2

cellulose synthase-like family D

cellulose synthase-like protein D4

AP001552

AK105393 AK102134 AK102695

BAA93027

 

 

 

 

Os08g25710

CSLD3

cellulose synthase-like family D

cellulose synthase D

AP004459

none

BAD01697 DAA01756

 

 

 

 

Os12g36890

CSLD4

cellulose synthase-like family D

cellulose synthase family protein

AL845342

none

 ABA99552

 

 

 

 

Os06g22980

CSLD5

cellulose synthase-like family D

cellulose synthase family protein; cellulose synthase-like protein D4

AP005449

AK072260   

BAD61907

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

Os09g30120

CSLE1

cellulose synthase-like family E

cellulose synthase-like protein OsCslE1

AP005759

AK102766

BAD46389  (right); AAL25129 (wrong); BAD46390 (truncated - alternative splice?)

 

 

 

TIGR says has alternative splicing. TIGR map shows overlap with 30100, which seems unlikely.

Os02g49332

CSLE2

cellulose synthase-like family E

cellulose synthase-like protein CslE

AP005113; AP004179

AK101487

BAD13046 BAD13047 (both wrong); AAL25130 (right); BAD12921  (wrong); BAD12922 (wrong); BAD12923 (wrong)

 

 

 

TIGR: 2 alternative splice forms

Os09g30130

CSLE6

cellulose synthase-like family E

Cellulose synthase family

AP005759

AK068464

BAD46391

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

Os07g36700

CSLF1

cellulose synthase-like family F; beta1,3;1,4 glucan synthase

Cellulose synthase family

AP004261

none

 

 

 

 

 

Os07g36690

CSLF2

cellulose synthase-like family F; beta1,3;1,4 glucan synthase

cellulose synthase-like protein OsCslF1

AP004261  AP005126 

AK100523, AK066835

BAD30521 BAC65378

 

 

 

 

Os07g36750

CSLF3

cellulose synthase-like family F; beta1,3;1,4 glucan synthase

Cellulose synthase family

AP004261

none

BAC83322

 

 

 

 

Os07g36740

CSLF4

cellulose synthase-like family F; beta1,3;1,4 glucan synthase

Cellulose synthase family

AP004261

none

BAC83321 

 

 

 

 

Os08g06380

CSLF6

cellulose synthase-like family F; beta1,3;1,4 glucan synthase

Cellulose synthase family; putative cellulose synthase-5

AP004635

AK065259

BAC66734

 

 

 

 

Os10g20260

CSLF7

cellulose synthase-like family F; beta1,3;1,4 glucan synthase

Cellulose synthase family; cellulose synthase D-like

AC090441 AP008216

AK110467

AAK91320

 

 

 

 

Os07g36630

CSLF8

cellulose synthase-like family F; beta1,3;1,4 glucan synthase

Cellulose synthase family

AP005126

AK067424

BAC65371

 

 

 

 

Os07g36610

CSLF9

cellulose synthase-like family F; beta1,3;1,4 glucan synthase

Cellulose synthase family; cellulose synthase-like protein
OsCslF1

AP005126

none

BAC80027 (correct)

 

 

 

TIGR model is wrong.

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

Os10g20090

CSLH1

cellulose synthase-like family H

Cellulose synthase family; putative cellulose
synthase

AC119148

AK069071; AK060286;  AK061162;  AK121003 (none are FL)

AAN01252 (wrong); ABB47240 (right); DAA01747 (right); ABB47241 (truncated)

 

 

 

 

Os04g35020

CSLH2

cellulose synthase-like family H

Cellulose synthase family

AL6066