Xavier Morin, Developmental Biology Institute of Marseilles, and William Chia, King's College London
Personal communication to the Bloomington Drosophila Stock Center, 21 December 2002
Sequences of protein trap constructs
described in:
Morin, X., Daneman, R., Zavortink, M., Chia, W. A protein trap strategy to
detect GFP-tagged proteins expressed from their endogenous
loci in Drosophila. Proc. Natl. Acad. Sci. USA 2001 98(26):15050--15055 (FBrf0141725).
Color Key
- in purple, capital letters: the EcoRI and
PstI sites,
- yellow letters in black boxes, the splice
sites.
- underlined capital letters: 6-HIS tag which
contains an NcoI site in blue italic capital underlined letters.
- green: GFP sequence
- ending with a SalI (blue, capital, italic) and XhoI (red capital)
LOCUS 5956 bp DNA SYN
DEFINITION P{GA/B/C} 51956 bp
ACCESSION -
KEYWORDS -
SOURCE
ORGANISM
COMMENT
FEATURES Location/Qualifiers
source 1..587
/note="segment=Car1_P5'(P5',R1) (GB/M30841 1..587);
segment_size=587"
source 588..601
/note="segment=J 001069;
segment_size=14"
source 602..3892
/note="segment=w+3.3(Xb,Sp) (GB/U59054 931..4221);
segment_size=3291"
source 3893..3906
/note="segment=J 001078;
segment_size=14"
source 3907..4728
/note="segment=w+0.8(Sp,H3) (GB/X02974 7204..8025);
segment_size=822"
source 4729..4732
/note="segment=J 001068;
segment_size=4"
source 4733..4738
/note="segment=EcoRI restriction site;
segment_size=4"
source 4739..4838
/note="segment=splice acceptor from drosophila mhc gene;
segment_size=106"
source 4839..4843
/note="segment=linker;
segment_size=5(variable length between GA, GB and GC)"
source 4844..4861
/note="segment=6HIS tag;
segment_size=18"
source 4862..4864
/note="segment=linker: GGG codon replaces ATG from EGFP;
segment_size=3
source 4865..5578
/note="segment=EGFP sequence without start and stop codons;
segment_size=714"
source 5579..5588
/note="segment=linker (contains SalI and XhoI restriction sites);
segment_size=10(variable length between GA, GB and GC)"
source 5589..5709
/note="segment=splice donor from drosophila mhc gene;
segment_size=121"
source 5710..5723
/note="segment=linker containing the PstI site of Casper4;
segment_size=14
source 5724..5956
/note="segment=Car1_P3'(H3,P3') (GB/M30841 624..856);
segment_size=233"
BASE COUNT 0 1487 a 1113 c 1210 g 1293 t
ORIGIN
1 catgatgaaa taacataagg tggtcccgtc gatagccgaa gcttaccgaa gtatacactt
61 aaattcagtg cacgtttgct tgttgagagg aaaggttgtg tgcggacgaa tttttttttg
121 aaaacattaa cccttacgtg gaataaaaaa aaatgaaata ttgcaaattt tgctgcaaag
181 ctgtgactgg agtaaaatta attcacgtgc cgaagtgtgc tattaagaga aaattgtggg
241 agcagagcct tgggtgcagc cttggtgaaa actcccaaat ttgtgatacc cactttaatg
301 attcgcagtg gaaggctgca cctgcaaaag gtcagacatt taaaaggagg cgactcaacg
361 cagatgccgt acctagtaaa gtgatagagc ctgaaccaga aaagataaaa gaaggctata
421 ccagtgggag tacacaaaca gagtaagttt gaatagtaaa aaaaatcatt tatgtaaaca
481 ataacgtgac tgtgcgttag gtcctgttca ttgtttaatg aaaataagag cttgagggaa
541 aaaattcgta ctttggagta cgaaatgcgt cgtttagagc agcagccgaa ttaattctag
601 ttccagtgaa atccaagcat tttctaaatt aaatgtattc ttattattat agttgttatt
661 tttgatatat ataaacaaca ctattatgcc caccattttt ttgagatgca tctacacaag
721 gaacaaacac tggatgtcac tttcagttca aattgtaacg ctaatcactc cgaacaggtc
781 acaaaaaatt accttaaaaa gtcataatat taaattagaa taaatatagc tgtgagggaa
841 atatatacaa atatattgga gcaaataaat tgtacataca aatatttatt actaatttct
901 attgagacga aatgaaccac tcggaaccat ttgagcgaac cgaatcgcgc ggaactaacg
961 acagtcgctc caaggtcgtc gaacaaaagg tgaatgtgtt gcggagagcg ggtgggagac
1021 agcgaaagag caactacgaa acgtggtgtg gtggaggtga attatgaaga gggcgcgcga
1081 tttgaaaagt atgtatataa aaaatatatc ccggtgtttt atgtagcgat aaacgagttt
1141 ttgatgtaag gtatgcaggt gtgtaagtct tttggttaga agacaaatcc aaagtctact
1201 tgtggggatg ttcgaagggg aaatacttgt attctatagg tcatatcttg tttttattgg
1261 cacaaatata attacattag ctttttgagg gggcaataaa cagtaaacac gatggtaata
1321 atggtaaaaa aaaaaacaag cagttatttc ggatatatgt cggctactcc ttgcgtcggg
1381 cccgaagtct tagagccaga tatgcgagca cccggaagct cacgatgaga atggccagac
1441 ccacgtagtc cagcggcaga tcggcggcgg agaagttaag cgtctccagg atgaccttgc
1501 ccgaactggg gcacgtggtg ttcgacgatg tgcagctaat ttcgcccggc tccacgtccg
1561 cccattggtt aatcagcaga ccctcgttgg cgtaacggaa ccatgagagg tacgacaacc
1621 atttgaggta tactggcacc gagcccgagt tcaagaagaa gccgccaaag agcaggaatg
1681 gtatgataac cggcggaccc acagacagcg ccatcgaggt cgaggagctg gcgcaggata
1741 ttagatatcc gaaggacgtt gacacattgg ccaccagagt gaccagcgcc aggcagttga
1801 agaagtgcag cactccggcc cgcagtccga tcatcggata ggcaatcgcc gtgaagacca
1861 gtggcactgt gagaaaaagc ggtaattcgg caatcgtttt gcccagaaag tatgtgtcac
1921 agcgataaag tcgacttcgg gcctccctca taaaaactgg cagctctgag gtgaacacct
1981 aaatcgaatc gattcattag aaagttagta aattattaat atgcaaatgt attctaaaca
2041 agacttacat ttatcgtggc aaagacgttt tgaaaggtca tgttggtcag gaagaggaag
2101 atggctccgt tgatattcat cacgcccact tgcgtgagtt gttggcccaa aaagatgagg
2161 ccaatcaaga tggcaaccat ctgcaaatta aaatgttact cgcatctcat taatattcat
2221 atcttcaaca tgttcgcgag ttaaatgaaa tttatttatt ttctgcaaaa ctataaacta
2281 tacatctcat tgaaaaaaac taagaagggt gtggaatcag gcaattctaa ctaaaatcta
2341 gcgaatttgt ttccaagaat tgtaagcgtt atatcatttg tttccactgg aaccactcac
2401 cgttgtctga ataagtcgca cttttacgag gagtggttcc ttgagcaccg acagccagga
2461 tcgccacagg accgcccgga actgcatgaa ccaggtggcc ttgtaggtgt acccattctc
2521 cggctgctcc agtggcttct ccaaattttt ggtggccaac aactgctcca tatcccgggc
2581 tactttgcta atagcaaaat tgtcgcatat cttggcgatc cgatcacggg actcgatctc
2641 ccgtccgggc acaacggcca acacctgtac gtaaaagtcc gccggattgt agttggtagg
2701 acactgggca cccacgctgg ataggagttg agatgtaatg taatgctaga tacccttaat
2761 aaacacatcg aactcactag gaaaagaagt cgacggcttc gctgggagtg cccaagaaag
2821 ctaccctgcc ctcggccatc agaaggatct tgtcaaagag ctcaaacagc tcggaagacg
2881 gctgatgaat ggtcaggatg acggtcttgc ccttctgcga cagcttcttc agcacctgga
2941 cgacgctgtg ggcggtaaat gagtccagtc cggaggtggg ctcatcgcag atcagaagcg
3001 gcggatcggt tagtgcctcg gaggcgaatg ccagacgctt cctttctccg ccggacagac
3061 ctttcaccct gccgggcaca ccgatgatcg tgtgctgaca tttgctgagc gaaagctcct
3121 ggatcacctg atccacgcgg gccactcgct gccgataggt cagatgtcgt ggcatccgca
3181 ccatggcctg gaaaatcagg tgttccctgg ccgttaggga gccgataaag aggtcatcct
3241 gctggacata ggcgcacctg gcctgcatct ccttggcgtc cacaggttgg ccattgagca
3301 gtcgcatccc ggatggcgat acttggatgc cctgcggcga tcgaaaggca agggcattca
3361 gcagggtcgt ctttccggca ccggaactgc ccatcacggc caaaagttcg cccggatagg
3421 ccacgccgca aactgagttt caaattggta attggaccct ttattaagat ttcacacaga
3481 tcagccgact gcgaatagaa actcaccgtt cttgagcaaa tgtttcctgg gcgccggtat
3541 gtgtcgctcg ttgcagaata gtccgcgtgt ccggttgacc agctgccgcc atccggagcc
3601 cggctgattg accgccccaa agatgtccat attgtgccag gcataggtga ggttctcggc
3661 tagttggccg ctccctgaac cggagtcctc cggcggactg ggtggcagga gcgtgccgta
3721 gtttttggcc tgcccgaagc cctggttaat gcagctctgc gaagccgctc cgctgtcacc
3781 ctgcaatgat aggggatctc aaatatcaac tacaagcgtt atgctcatct aaccccgaac
3841 aaaacgaagt atcctacgaa gtaggtttat acttttattt attttttgtg catctaggat
3901 cagcttaaaa tatctggttg ttatattttt tgtaaaaaag aatgtagtcg aaaatgaatg
3961 cctttagatg tcttgatcat gatatgatct taaaaattgt cttatatagc gagcacagct
4021 accagaataa tctgtttcgt gtcactattt gtttgtgcga ttgcggtttg ggatttttgt
4081 gggtcgcagt tctcacgccg cagacaattt gatgttgcaa tcgcagttcc tatagatcaa
4141 gtgaacttaa gatgtatgca catgtactac tcacattgtt cagatgctcg gcagatgggt
4201 gtttgctgcc tccgcgaatt aatagctcct gatcctcttg gcccattgcc gggatttttc
4261 acactttccc ctgcttaccc acccaaaacc aatcaccacc ccaatcactc aaaaaacaaa
4321 caaaaataag aagcgagagg agttttggca cagcactttg tgtttaattg atggcgtaaa
4381 ccgcttggag cttcgtcacg aaaccgctga caaagtgcaa ctgaaggcgg acattgacgc
4441 taggtaacgc tacaaacggt ggcgaaagag atagcggacg cagcggcgaa agagacggcg
4501 atatttctgt ggacagagaa ggaggcaaac agcgctgact ttgagtggaa tgtcattttg
4561 agtgagaggt aatcgaaaga acctggtact tcaaataccc ttggatcgaa gtaaatttaa
4621 aactgatcag ataagttcaa tgatgtccag tgcagtaaaa aataaaaaaa aaatatgttt
4681 ttttaaatct acattctcca aaaaagggtt ttattaactt acatacatac taGAATTCag
4741 tcgatccaac atggcgactt gtcccatccc cggcatgttt aaatatacta attattcttg
4801 aactaatttt aatcaaccga tttatctctc ttccgcagAT CCACACCATC ATCACCACCA
4861 TGGggtgagc aagggcgagg agctgttcac cggggtggtg cccatcctgg tcgagctgga
4921 cggcgacgta aacggccaca agttcagcgt gtccggcgag ggcgagggcg atgccaccta
4981 cggcaagctg accctgaagt tcatctgcac caccggcaag ctgcccgtgc cctggcccac
5041 cctcgtgacc accctgacct acggcgtgca gtgcttcagc cgctaccccg accacatgaa
5101 gcagcacgac ttcttcaagt ccgccatgcc cgaaggctac gtccaggagc gcaccatctt
5161 cttcaaggac gacggcaact acaagacccg cgccgaggtg aagttcgagg gcgacaccct
5221 ggtgaaccgc atcgagctga agggcatcga cttcaaggag gacggcaaca tcctggggca
5281 caagctggag tacaactaca acagccacaa cgtctatatc atggccgaca agcagaagaa
5341 cggcatcaag gtgaacttca agatccgcca caacatcgag gacggcagcg tgcagctcgc
5401 cgaccactac cagcagaaca cccccatcgg cgacggcccc gtgctgctgc ccgacaacca
5461 ctacctgagc acccagtccg ccctgagcaa agaccccaac gagaagcgcg atcacatggt
5521 cctgctggag ttcgtgaccg ccgccgggat cactctcggc atggacgagc tgtacaaGTC
5581 GACTCGAGgt aagttattga acaatggcat caaatgcctt catcatcact accctttagc
5641 ccttaagacc ccacaatgac cttacccact cagagaaaaa agtaaatatg aaagcccatt
5701 tgaacttctC TGCAGccaag ctttgcgtac tcgcaaatta ttaaaaataa aactttaaaa
5761 ataatttcgt ctaattaata ttatgagtta attcaaaccc cacggacatg ctaagggtta
5821 atcaacaatc atatcgctgt ctcactcaga ctcaatacga cactcagaat actattcctt
5881 tcactcgcac ttattgcaag catacgttaa gtggatgtct cttgccgacg ggaccacctt
5941 atgttatttc atcatg
GAATTCagtcgatccaacatggcgacttgtcccatc)atgtttaaatatactaattattcttgaactaattttaatcaaccgatttatctctcttccgcag/
AT CCA CAC CAT CAT CAC CAC CAT GGg gtg agc aag ggc gag gag ctg ttc acc ggg gtg gtg ccc atc ctg gtc gag
ctg gac ggc gac gta aac ggc cac aag ttc agc gtg tcc ggc gag ggc gag ggc gat gcc
acc tac ggc aag ctg acc ctg aag ttc atc tgc acc acc ggc aag ctg ccc gtg ccc tgg
ccc acc ctc gtg acc acc ctg acc tac ggc gtg cag tgc ttc agc cgc tac ccc gac cac
atg aag cag cac gac ttc ttc aag tcc gcc atg ccc gaa ggc tac gtc cag gag cgc acc
atc ttc ttc aag gac gac ggc aac tac aag acc cgc gcc gag gtg aag ttc gag ggc gac
acc ctg gtg aac cgc atc gag ctg aag ggc atc gac ttc aag gag gac ggc aac atc ctg
ggg cac aag ctg gag tac aac tac aac agc cac aac gtc tat atc atg gcc gac aag cag
aag aac ggc atc aag gtg aac ttc aag atc cgc cac aac atc gag gac ggc agc gtg cag
ctc gcc gac cac tac cag cag aac acc ccc atc ggc gac ggc ccc gtg ctg ctg ccc gac
aac cac tac ctg agc acc cag tcc gcc ctg agc aaa gac ccc aac gag aag cgc gat cac
atg gtc ctg ctg gag ttc gtg acc gcc gcc ggg atc act ctc ggc atg gac gag ctg tac
aaG TCG ACT CGA G/
gtaagttattgaacaatggcatcaaatgccttcatcatcactaccctttagcccttaagaccccacaatgaccttacccactcagagaaaaaagtaaatatgaaagcccatttgaacttctctgcag
GAATTCagtcgatccaacatggcgacttgtcccatccccggcatgtttaaatatactaattattcttgaactaattttaatcaaccgatttatctctcttccgcag/
A TCC CAC CAT CAT CAC CAC CAT GGg gtg agc aag ggc gag gag ctg ttc acc ggg
gtg gtg ccc atc ctg gtc gag ctg gac ggc gac gta aac ggc cac aag ttc agc gtg tcc
ggc gag ggc gag ggc gat gcc acc tac ggc aag ctg acc ctg aag ttc atc tgc acc acc
ggc aag ctg ccc gtg ccc tgg ccc acc ctc gtg acc acc ctg acc tac ggc gtg cag tgc
ttc agc cgc tac ccc gac cac atg aag cag cac gac ttc ttc aag tcc gcc atg ccc gaa
ggc tac gtc cag gag cgc acc atc ttc ttc aag gac gac ggc aac tac aag acc cgc gcc
gag gtg aag ttc gag ggc gac acc ctg gtg aac cgc atc gag ctg aag ggc atc gac ttc
aag gag gac ggc aac atc ctg ggg cac aag ctg gag tac aac tac aac agc cac aac gtc
tat atc atg gcc gac aag cag aag aac ggc atc aag gtg aac ttc aag atc cgc cac aac
atc gag gac ggc agc gtg cag ctc gcc gac cac tac cag cag aac acc ccc atc ggc gac
ggc ccc gtg ctg ctg ccc gac aac cac tac ctg agc acc cag tcc gcc ctg agc aaa gac
ccc aac gag aag cgc gat cac atg gtc ctg ctg gag ttc gtg acc gcc gcc ggg atc act
ctc ggc atg gac gag ctg tac aaG TCG ACC TCG AG/
gtaagttattgaacaatggcatcaaatgccttcatcatcactaccctttagcccttaagaccccacaatgaccttacccactcagagaaaaaagtaaatatgaaagcccatttgaacttctctgcag
GAATTCagtcgatccaacatggcgacttgtcccatccccggcatgtttaaatatactaattattcttgaactaattttaatcaaccgatttatctctcttccgcag/
ATC CAC CAT CAT CAC CAC CAT GGg gtg agc aag ggc gag gag ctg ttc acc ggg
gtg gtg ccc atc ctg gtc gag ctg gac ggc gac gta aac ggc cac aag ttc agc gtg tcc
ggc gag ggc gag ggc gat gcc acc tac ggc aag ctg acc ctg aag ttc atc tgc acc acc
ggc aag ctg ccc gtg ccc tgg ccc acc ctc gtg acc acc ctg acc tac ggc gtg cag tgc
ttc agc cgc tac ccc gac cac atg aag cag cac gac ttc ttc aag tcc gcc atg ccc gaa
ggc tac gtc cag gag cgc acc atc ttc ttc aag gac gac ggc aac tac aag acc cgc gcc
gag gtg aag ttc gag ggc gac acc ctg gtg aac cgc atc gag ctg aag ggc atc gac ttc
aag gag gac ggc aac atc ctg ggg cac aag ctg gag tac aac tac aac agc cac aac gtc
tat atc atg gcc gac aag cag aag aac ggc atc aag gtg aac ttc aag atc cgc cac aac
atc gag gac ggc agc gtg cag ctc gcc gac cac tac cag cag aac acc ccc atc ggc gac
ggc ccc gtg ctg ctg ccc gac aac cac tac ctg agc acc cag tcc gcc ctg agc aaa gac
ccc aac gag aag cgc gat cac atg gtc ctg ctg gag ttc gtg acc gcc gcc ggg atc act
ctc ggc atg gac gag ctg tac aaG TCG ACA CTC GAG/
gtaagttattgaacaatggcatcaaatgccttcatcatcactaccctttagcccttaagaccccacaatgaccttacccactcagagaaaaaagtaaatatgaaagcccatttgaacttctctgcag