back Return to this vector's summary.
This entry was contributed by Kurt R. Klimpel (klimpel@yoda.nidr.nih.gov)

See a further note contributed by Deepa Bilolikar (deepab@sun.science.wayne.edu)

I used your sequence to reconstruct a fusion protein based on the pGEX- KG sequence in your database. The Sequence contains an error which alters the coding sequence of the linker region after the GST domain. There is an extra "A" at position 941. This changes the linker from a Gly linker to a Trp linker and it should be corrected.

Below is the corrected sequence.

  
LOCUS       pGEX-KG     5005 BP   DNA                         PRELIMINARY
DEFINITION  pGEX-KG
BASE COUNT    1233 a   1205 c   1302 g   1265 t
        1 acgttatcga ctgcacggtg caccaatgct tctggcgtca ggcagccatc ggaagctgtg
       61 gtatggctgt gcaggtcgta aatcactgca taattcgtgt cgctcaaggc gcactcccgt
      121 tctggataat gttttttgcg ccgacatcat aacggttctg gcaaatattc tgaaatgagc
      181 tgttgacaat taatcatcgg ctcgtataat gtgtggaatt gtgagcggat aacaatttca
      241 cacaggaaac agtattcatg tcccctatac taggttattg gaaaattaag ggccttgtgc
      301 aacccactcg acttcttttg gaatatcttg aagaaaaata tgaagagcat ttgtatgagc
      361 gcgatgaagg tgataaatgg cgaaacaaaa agtttgaatt gggtttggag tttcccaatc
      421 ttccttatta tattgatggt gatgttaaat taacacagtc tatggccatc atacgttata
      481 tagctgacaa gcacaacatg ttgggtggtt gtccaaaaga gcgtgcagag atttcaatgc
      541 ttgaaggagc ggttttggat attagatacg gtgtttcgag aattgcatat agtaaagact
      601 ttgaaactct caaagttgat tttcttagca agctacctga aatgctgaaa atgttcgaag
      661 atcgtttatg tcataaaaca tatttaaatg gtgatcatgt aacccatcct gacttcatgt
      721 tgtatgacgc tcttgatgtt gttttataca tggacccaat gtgcctggat gcgttcccaa
      781 aattagtttg ttttaaaaaa cgtattgaag ctatcccaca aattgataag tacttgaaat
      841 ccagcaagta tatagcatgg cctttgcagg gctggcaagc cacgtttggt ggtggcgacc
      901 atcctccaaa atcggatctg gttccgcgtg gatccccggg aatttccggt ggtggtggtg
      961 gaattctaga ctccatgggt cgactcgagc tcaagcttat tcatcgtgac tgactgacga
     1021 tctgcctcgc gcgtttcggt gatgacggtg aaaacctctg acacatgcag ctcccggaga
     1081 cggtcacagc ttgtctgtaa gcggatgccg ggagcagaca agcccgtcag ggcgcgtcag
     1141 cgggtgttgg cgggtgtcgg ggcgcagcca tgacccagtc acgtagcgat agcggagtgt
     1201 ataattcttg aagacgaaag ggcctcgtga tacgcctatt tttataggtt aatgtcatga
     1261 taataatggt ttcttagacg tcaggtggca cttttcgggg aaatgtgcgc ggaaccccta
     1321 tttgtttatt tttctaaata cattcaaata tgtatccgct catgagacaa taaccctgat
     1381 aaatgcttca ataatattga aaaaggaaga gtatgagtat tcaacatttc cgtgtcgccc
     1441 ttattccctt ttttgcggca ttttgccttc ctgtttttgc tcacccagaa acgctggtga
     1501 aagtaaaaga tgctgaagat cagttgggtg cacgagtggg ttacatcgaa ctggatctca
     1561 acagcggtaa gatccttgag agttttcgcc ccgaagaacg ttttccaatg atgagcactt
     1621 ttaaagttct gctatgtggc gcggtattat cccgtgttga cgccgggcaa gagcaactcg
     1681 gtcgccgcat acactattct cagaatgact tggttgagta ctcaccagtc acagaaaagc
     1741 atcttacgga tggcatgaca gtaagagaat tatgcagtgc tgccataacc atgagtgata
     1801 acactgcggc caacttactt ctgacaacga tcggaggacc gaaggagcta accgcttttt
     1861 tgcacaacat gggggatcat gtaactcgcc ttgatcgttg ggaaccggag ctgaatgaag
     1921 ccataccaaa cgacgagcgt gacaccacga tgcctgcagc aatggcaaca acgttgcgca
     1981 aactattaac tggcgaacta cttactctag cttcccggca acaattaata gactggatgg
     2041 aggcggataa agttgcagga ccacttctgc gctcggccct tccggctggc tggtttattg
     2101 ctgataaatc tggagccggt gagcgtgggt ctcgcggtat cattgcagca ctggggccag
     2161 atggtaagcc ctcccgtatc gtagttatct acacgacggg gagtcaggca actatggatg
     2221 aacgaaatag acagatcgct gagataggtg cctcactgat taagcattgg taactgtcag
     2281 accaagttta ctcatatata ctttagattg atttaaaact tcatttttaa tttaaaagga
     2341 tctaggtgaa gatccttttt gataatctca tgaccaaaat cccttaacgt gagttttcgt
     2401 tccactgagc gtcagacccc gtagaaaaga tcaaaggatc ttcttgagat cctttttttc
     2461 tgcgcgtaat ctgctgcttg caaacaaaaa aaccaccgct accagcggtg gtttgtttgc
     2521 cggatcaaga gctaccaact ctttttccga aggtaactgg cttcagcaga gcgcagatac
     2581 caaatactgt ccttctagtg tagccgtagt taggccacca cttcaagaac tctgtagcac
     2641 cgcctacata cctcgctctg ctaatcctgt taccagtggc tgctgccagt ggcgataagt
     2701 cgtgtcttac cgggttggac tcaagacgat agttaccgga taaggcgcag cggtcgggct
     2761 gaacgggggg ttcgtgcaca cagcccagct tggagcgaac gacctacacc gaactgagat
     2821 acctacagcg tgagctatga gaaagcgcca cgcttcccga agggagaaag gcggacaggt
     2881 atccggtaag cggcagggtc ggaacaggag agcgcacgag ggagcttcca gggggaaacg
     2941 cctggtatct ttatagtcct gtcgggtttc gccacctctg acttgagcgt cgatttttgt
     3001 gatgctcgtc aggggggcgg agcctatgga aaaacgccag caacgcggcc tttttacggt
     3061 tcctggcctt ttgctggcct tttgctcaca tgttctttcc tgcgttatcc cctgattctg
     3121 tggataaccg tattaccgcc tttgagtgag ctgataccgc tcgccgcagc cgaacgaccg
     3181 agcgcagcga gtcagtgagc gaggaagcgg aagagcgcct gatgcggtat tttctcctta
     3241 cgcatctgtg cggtatttca caccgcataa attccgacac catcgaatgg tgcaaaacct
     3301 ttcgcggtat ggcatgatag cgcccggaag agagtcaatt cagggtggtg aatgtgaaac
     3361 cagtaacgtt atacgatgtc gcagagtatg ccggtgtctc ttatcagacc gtttcccgcg
     3421 tggtgaacca ggccagccac gtttctgcga aaacgcggga aaaagtggaa gcggcgatgg
     3481 cggagctgaa ttacattccc aaccgcgtgg cacaacaact ggcgggcaaa cagtcgttgc
     3541 tgattggcgt tgccacctcc agtctggccc tgcacgcgcc gtcgcaaatt gtcgcggcga
     3601 ttaaatctcg cgccgatcaa ctgggtgcca gcgtggtggt gtcgatggta gaacgaagcg
     3661 gcgtcgaagc ctgtaaagcg gcggtgcaca atcttctcgc gcaacgcgtc agtgggctga
     3721 tcattaacta tccgctggat gaccaggatg ccattgctgt ggaagctgcc tgcactaatg
     3781 ttccggcgtt atttcttgat gtctctgacc agacacccat caacagtatt attttctccc
     3841 atgaagacgg tacgcgactg ggcgtggagc atctggtcgc attgggtcac cagcaaatcg
     3901 cgctgttagc gggcccatta agttctgtct cggcgcgtct gcgtctggct ggctggcata
     3961 aatatctcac tcgcaatcaa attcagccga tagcggaacg ggaaggcgac tggagtgcca
     4021 tgtccggttt tcaacaaacc atgcaaatgc tgaatgaggg catcgttccc actgcgatgc
     4081 tggttgccaa cgatcagatg gcgctgggcg caatgcgcgc cattaccgag tccgggctgc
     4141 gcgttggtgc ggatatctcg gtagtgggat acgacgatac cgaagacagc tcatgttata
     4201 tcccgccgtt aaccaccatc aaacaggatt ttcgcctgct ggggcaaacc agcgtggacc
     4261 gcttgctgca actctctcag ggccaggcgg tgaagggcaa tcagctgttg cccgtctcac
     4321 tggtgaaaag aaaaaccacc ctggcgccca atacgcaaac cgcctctccc cgcgcgttgg
     4381 ccgattcatt aatgcagctg gcacgacagg tttcccgact ggaaagcggg cagtgagcgc
     4441 aacgcaatta atgtgagtta gctcactcat taggcacccc aggctttaca ctttatgctt
     4501 ccggctcgta tgttgtgtgg aattgtgagc ggataacaat ttcacacagg aaacagctat
     4561 gaccatgatt acggattcac tggccgtcgt tttacaacgt cgtgactggg aaaaccctgg
     4621 cgttacccaa cttaatcgcc ttgcagcaca tccccctttc gccagctggc gtaatagcga
     4681 agaggcccgc accgatcgcc cttcccaaca gttgcgcagc ctgaatggcg aatggcgctt
     4741 tgcctggttt ccggcaccag aagcggtgcc ggaaagctgg ctggagtgcg atcttcctga
     4801 ggccgatact gtcgtcgtcc cctcaaactg gcagatgcac ggttacgatg cgcccatcta
     4861 caccaacgta acctatccca ttacggtcaa tccgccgttt gttcccacgg agaatccgac
     4921 gggttgttac tcgctcacat ttaatgttga tgaaagctgg ctacaggaag gccagacgcg
     4981 aattattttt gatggcgttg gaatt
//

A further note added by Deepa Bilolikar

I was cloning a gene into pGEX KG and after sequencing, I noticed a discrepancy between the sequence you've published on the web with that of the original paper by Dixon. The difference comes in the multiple cloning site. Immediately 3' to the HindIII site, the original paper and my sequence reads "AAT TCA TCG TGA" which codes for "N S S stop". The web sequence reades "ATT CAT CGT GAC TGA" (one base off, missing the first A) which codes for "I H A D stop". This doesn't make a difference in the fusion protein since the stop codon is still there, but if someone needed the Serines there versus a basic Histidine, it might pose a problem if they thought the reverse.