(c) 1992-2000 Regents of the University of California, Santa Cruz
Sequence Alignment and Modeling Software System
http://www.soe.ucsc.edu/research/compbio/sam.html

Citations (SAM, SAM-T99, HMMs)

Sequence numbers correspond to the following labels:

    • YDR031W YDR031W SGDID:S0002438, Chr IV from 503493-503846
    • gi|55600472|ref|XP_515693.1|_120:223 PREDICTED: similar to coiled-coil-helix-coiled-coil-helix domain containing 5; chromosome 2 open reading frame 9 [Pan troglodytes]
    • gi|83004949|ref|XP_918378.1|_13:113 PREDICTED: similar to coiled-coil-helix-coiled-coil-helix domain containing 5 [Mus musculus]
    • gi|83004725|ref|XP_897110.1|_59:157 PREDICTED: similar to coiled-coil-helix-coiled-coil-helix domain containing 5 [Mus musculus]
    • gi|76674234|ref|XP_881125.1|_2:94 PREDICTED: similar to coiled-coil-helix-coiled-coil-helix domain containing 5 isoform 2 [Bos taurus]
    • gi|50757139|ref|XP_415397.1|_1009:1106 PREDICTED: similar to LIM homeobox protein 6 isoform 1; LIM homeodomain protein 6.1 [Gallus gallus]
    • gi|55632729|ref|XP_520400.1|_21:118 PREDICTED: similar to NADH dehydrogenase (ubiquinone) 1 alpha subcomplex, 8, 19kDa; NADH-ubiquinone oxidoreductase 19 kDa subunit; NADH:ubiquinone oxidoreductase PGIV subunit; NADH dehydrogenase (ubiquinone) 1 alpha subcomplex, 8 (19kD, PGIV) ... [Pan troglodytes]
    • gi|55663204|emb|CAH73980.1| NADH dehydrogenase (ubiquinone) 1 alpha subcomplex, 8, 19kDa [Homo sapiens]
    • gi|7657369|ref|NP_055037.1| NADH dehydrogenase (ubiquinone) 1 alpha subcomplex, 8, 19kDa [Homo sapiens]
    • gi|5326825|gb|AAD42056.1| NADH:ubiquinone oxidoreductase PGIV subunit [Homo sapiens]
    • gi|12654385|gb|AAH01016.1| NADH dehydrogenase (ubiquinone) 1 alpha subcomplex, 8, 19kDa [Homo sapiens]
    • gi|8039804|sp|P51970|NUPM_HUMAN NADH-ubiquinone oxidoreductase 19 kDa subunit (Complex I-19KD) (CI-19KD) (Complex I-PGIV) (CI-PGIV) [Taxonomy]
    • gi|28461275|ref|NP_787020.1|_23:117 NADH dehydrogenase (ubiquinone) 1 alpha subcomplex, 8, 19kDa [Bos taurus]
    • gi|86827664|gb|AAI05366.1| NADH dehydrogenase (ubiquinone) 1 alpha subcomplex, 8, 19kDa [Bos taurus]
    • gi|1171870|sp|P42029|NUPM_BOVIN NADH-ubiquinone oxidoreductase 19 kDa subunit (Complex I-19KD) (CI-19KD) (Complex I-PGIV) (CI-PGIV) [Taxonomy]
    • gi|599681|emb|CAA42218.1| 19 kDa subunit of NADH:ubiquinone oxidoreductase complex (complex I) [Bos taurus]
    • gi|68357132|ref|XP_694033.1|_191:236 PREDICTED: similar to LIM homeobox transcription factor 1 alpha, partial [Danio rerio]
              10        20        30         40        50         60   
              |         |         |          |         |          |   
   1 MSDILDEIVIEDVVANCPQEFLQYHKCIRDNE.ENPGKCKDGRMILSTCIR.EKVPSVKSIMSEC
   2 ------QAALEVTARYCGRELEQYGQCVAAKP.ESWQRDCHYLKMSIAQCT.SSHPIIRQIRQAC
   3 MSDILDEIVIEDVVANCPQEFLQYHKCIRDNE.ENPGKCKDGRMILSTCIR.EKVPSVKSIMSEC
   4 ------QAALEITARYCGRELEQYGQCVVAKP.ESWQRDCHHLKMSIAQCT.SSHPIIRQIRQAC
   5 ------QAALEVTARYCSRELDQYGQCVAAKP.ESWHRDCHHLKMSIARCT.SSHPIIRQIRQAC
   6 ------QAALEVTARYCGRELEQYGQCVAAKP.ESWQRDCHYLKMSIAQCT.SSHPIIRQIRQAC
   7 -SSAPLMSAAYFIGARCRDYNDDFMQCKNENP.GKGEFECLKEGRRVTRCA.RS--VIADINKSC
   8 -SSAPLMSAAYFIGARCRDYNDDFMQCKNENP.GKGEFECLKEGRRLTRCA.RS--VIADINKSC
   9 -SSAPLMSAAYFIGARCRDYNDDFMQCKNENP.GKGEFECLKEGRRLTRCA.RS--VIADINKSC
  10 ---LEMQAALEVTARYCGRELEQYGQCVAAKP.ESWQRDCHYLKMSIAQCT.SSHPIIRQIRQAC
  11 -SSAPLLSASFFIGARCRDYNDDYMQCKNENP.GRGEFECLKEGRRVTRCA.TS--VIKDINTHC
  12 -SSAPLLSASFFIGARCKDYNDDYMQCKTENP.GRGEFECMKEGRRVTRCA.RS--VLEDINKSC
  13 ---HWIQAALEVTAQYCIQELDQYGQCVAAKP.ELWHRDCHPLKMSIARCT.SSHPTIHQNLQAC
  14 -----IQAALEVTAQYCIQELDQYGQCVAAKP.ELWHRDCHPLKMSIARCT.SSHPTIHQNLQAC
  15 ------QAALEITARYCSRELEQYGQCVAAKP.ESWQRDCHHLKMSIAQCT.SAHPIIRQIRQAC
  16 MSDILDEIVIEDVVANCPQEFLQYHKCIRDNE.ENPGKCKDGRMILSTCIR.EKVPSVKSIMSEC
  17 ---ALDQFVMEDVAKHCPNEFMQYHKCISMNH.EDPSQCDFRQRDLAVCIK.QKVPVVQDIMKHC
  18 ------QAALEITARYCSRELEQYGQCVAAKP.ESWQRDCHHLKMSIAQCT.SAHPIIRQIRQAC
  19 ------QAALEITAKYCRNEMEEYGQCVTSKP.GTWQQDCHMLKVKVAQCT.SSHPVIKRIRSQC
  20 -SSAPLLSASFFIGARCRDYNDDYMQCKTENS.GNGEAACLKEGRRVTRCA.RS--VVEDINKSC
  21 --GFLDQFLLEDISRHCPHQFLSFHQCMTLPQ.PDPNQCFQQQVDLTKCIK.TSVPSFAKIQNEC
  22 -SGLLDQILLEDIARHCPQQFLAFHQCMSKPS.PDANLCGLEQYNLAGCIK.KDVPAFQKIQGVC
  23 ----------ELVAKFCHSELEAYGSCVQDNP.QNWPTKCAELKKKVSNCS.STHPSIQRIKRDC
  24 MSMLLDQALVEDVARYCPEQFLNYHKCL--GA.GDVTKCFEEQEKLSTCVK.TSVPTFIKILKDC
  25 --SAPLLSASYFIGDRCKAFNDDFMKCKAEAN.GRGELECLKEGRKVTRCA.AS--VIKDINTHC
  26 --SAPLLSASYFIGDRCKAFNDDFMKCKAEAN.GRGELECLKEGRKVTRCA.AS--VIKDINTHC
  27 --SAPLTSAAYFIGDRCKAFNDDYMKCKEEAN.GRGEIECLREGRKVTRCA.AS--VIKDINTHC
  28 --SAPLLSASYFIGAKCKPYNDDFMLCREESQ.GSGAIDCLKEGRRVTRCA.VS--VIEDINKSC
  29 --SAPLLSASYFIGSKCKPYNDDFMLCKDENN.G-GTLECLKEGRRVTRCA.IS--VLSDINKYC
  30 --SAPLLSASYYIGDKCKPFNEDFLLCKEE-H.NGGTLECMKEGRRVTRCA.IS--VLKDINKYC
  31 --SAPLKSASFFIGEHCKDVNEDFMLCK--NE.SRDPAHCLKEGRRVTRCA.RD--LIKKLSDSC
  32 -SSAVLKAAAHHYGSQCDRPNKEFMLCR--WE.EKDPRKCLREGRQVNQCA.LD--FFRKIKVHC
  33 --------------------------------.--WQNKCKDLGEAASACS.STHPAVQKIQHDC
  34 -SSAVLKAAAHHYGAQCDKTNKEFMLCR--WE.EKDPRRCLKEGKLVNGCA.LN--FFRQIKSHC
  35 --------------------------CR--EE.EKDPRKCLEEGRQVTACS.FK--FFNQIRTHC
  36 -SSAVLKAAAHHYGAQCDKPNKEFMLCR--WE.EKDPRRCLEEGKLVNKCA.LD--FFRQIKRHC
  37 -SSAVLKAAAHHYGAQCDKPNKEFMLCR--WE.EKDPRRCLEEGKLVNKCA.LD--FFRQIKRHC
  38 -SSAVLKAAAHHYGAQCDKTNKEFMLCR--WE.EKDPRRCLKEGKLVNGCA.LN--FFRQIKSHC
  39 -SSAVLKAAAHHYGAQCDKTNKEFMLCR--WE.EKDPRRCLKEGKLVNGCA.LN--FFRQIKSHC
  40 ---SVLKAAAHHYGAQCDKPNKEFMLCR--WE.EKDPRRCLEEGKLVNQCA.LE--FFRQIKRHC
  41 -----------VVAANCATQMAKYQECVLKNQaGDWNQICRPEGRALAACAdAAVPHLAELKASC
  42 --------------------------------.------------------.--LPVIRKIRSDC
  43 ---PWLKAVAPYMAKHCEKEANEFM--LRRKE.SEDPRAVLKEGAALTACG.VN--FLQSLKRSC
  44 --------------AVCGEVFEAYEKCRMEKG.SDPELC-LRESTAVVGCS.QK--VMREIVKNC
  45 ---------AKHIGMRCMPENVAFLKCKKNDP.--NPEKCLDKGRDVTRCV.LG--LLKDLHQKC


           70          80             90       100       110       120 
           |           |              |         |         |         | 
   1 SEPMKKYDQCIRD.N.MGTR.....TINENCLGFLQDLRKCAELQVKNKNIKPSINGVNLELIKD
   2 AQPFEAFEECLRQ.N.EAAV.....GNCAEHMRRFLQCAEQVQPPRSPATV--------------
   3 SEPMKKYDQCIRD.N.MGTR.....TINENCLGFLQDLRKCAELQVKNKNIKPSINGVNLELIKD
   4 AEPFEAFEECLRQ.N.EAAV.....GNCAEHVRRFLQCAEQVQPPRSPSAMEHEENP--------
   5 AEPFEAFEKCLRL.N.EAAV.....GNCAEHMRRFLQCAEQVQPPSSP-----------------
   6 AQPFEAFEECLRQ.N.EAAV.....GNCAEHMRRFLQCAEQVQPPRSPATVE-------------
   7 LEEFRKHWTCLED.N.NQQL.....WQCRPAEWKLNKCVFENLGLKKEIPDQPP-----------
   8 LEEFRKHWTCLED.N.NQQL.....WQCRPAEWKLNKCVFENLGLKKEIPDQPP-----------
   9 LEEFRKHWTCLED.N.NQQL.....WQCRPAEWKLNKCVFENLGLKKEIPDQPP-----------
  10 AQPFEAFEECLRQ.N.EAAV.....GNCAEHMRRFLQCAEQVQPPRSPATV--------------
  11 LAEFRKHWECLDD.R.NHQL.....WQCRPAEWKLNKCVFDNMKLEKKIPDQPT-----------
  12 LEQFRNHWQCLEN.N.NQQL.....WQCRPDEWKLNKCVFEKLNLEKVIPDQPK-----------
  13 AEPFEAFEECLHL.N.EAAV.....GNCVKHVGHFLQCADQVQPPSSP-----------------
  14 AEPFEAFEECLHL.N.EAAV.....GNCVKHVGHFLQCADQVQPPSSP-----------------
  15 SEPFKAFEECLRQ.N.EAAM.....GNCAEHVRRFLQCAEQAH----------------------
  16 SEPMKKYDQCIRD.N.MGTR.....TINENCLGFLQDLRKCAELQVKNKNIKPSINGVN------
  17 SAQMQRYEQCIRD.H.MESR.....TINENCLGLMAEMRSCAERQTAGKARPINELGGRG-----
  18 SEPFKAFEECLRQ.N.EAAM.....GNCAEHVRRFLQCAEQVQPTHRPS----------------
  19 AEPFGAFEQCLKE.N.QTSV.....ENCTKHVTDFLRCAE-------------------------
  20 LEEFRKHWQCLDN.N.NHQL.....WQCRPAEWKLNKCVYENLGLEKTIPDQPT-----------
  21 FGKMQAYEACLKM.N.KSNT.....KSCSHELQNLRNCAFGT-----------------------
  22 SGKLQAYEACLKM.N.GSDQ.....KKCSQDLQSLRDCAFGS-----------------------
  23 HKQFQVYDYCIRN.N.PSDV.....EVCVPALRDFFTCGHNAASDQQL-----------------
  24 DSHLKAYENCLRA.N.QNSR.....SECFDKLQAMRKCSANAIDIAK------------------
  25 LKQFNAHWECLEN.N.NQNL.....WECRKPEMELNSCVFEKLGLKKTIPGAP------------
  26 LKQFNAHWECLEN.N.NQNL.....WECRKPEMELNSCVFEKLGLKKTIPGAP------------
  27 LKQFNTHWECLEN.N.NHRL.....WECRKQEMDLNKCVFDKLGLKKTIPGAP------------
  28 LDEFRLHWQCLEQ.N.NHQL.....SGCRKAEALLNKCVFTKLNLEKKIP---------------
  29 FDEFKLHYECLEQ.E.NHRL.....GHCRNSESVLNKCIFQNMKLEKKIPDVE------------
  30 FDEFKLHYECLEQ.N.NQYF.....SRCRSSEGVLSKCVFDNMKLVKKIP---------------
  31 GKEWEAHYQCLEQ.H.NQEF.....YRCRKPEKTLNQCVFEKLKLAKNIPGSP------------
  32 AEPFTEYWTCIDYsN.LQEL.....RRCRKQQAVFDNCVLEKLGWV-------------------
  33 ASEYQQYDECLKS.N.PQDV.....TKCVNQLHEFMNCAE-------------------------
  34 AEPFTEYWTCLDYsN.MQLF.....RHCRQQQAKFDQCVLDKLGWVR------------------
  35 NESFTEHWTCLDY.N.KQEF.....RRCRQSQKKFDTCVFENLGWVRP-----------------
  36 AEPFTEYWTCIDY.T.GQQLf....RHCRKQQAKFDECVLDKLGWV-------------------
  37 AEPFTEYWTCIDY.T.GQQLf....RHCRKQQAKFDECVLDKMGWV-------------------
  38 AEPFTEYWTCLDY.SsMQLF.....RHCRKQQAKFDECVLDKLGWV-------------------
  39 AEPFTEYWTCLDY.SsMQLF.....RHCRKQQAKFDECVLDKLGWV-------------------
  40 AEPFTEYWTCIDY.S.GLQLf....RRCRKQQAQFDECVLDKLGW--------------------
  41 AEQIATYRQCLEK.H.SSQPdevisKNCGGLMKTLWECT--------------------------
  42 GSEFSVFERCLQE.N.QSSA.....EACQSHLSRFLTCVETV-----------------------
  43 LPQTQKLAECVDQ.G.SAKLym...SKCHNDQKELDACVEANMNLTRP-----------------
  44 QKELNESVKCIEE.N.NMRT.....IPCEEENKAFNECF--------------------------
  45 QKEMDDYVGCMYY.Y.TNEF.....DLCRKEQ---------------------------------