2016-06-30 2 views
1

awk를 사용하여 파일에서 정보를 추출하려고합니다. headerlist.txt 파일이 정확히 같은 모양AWK - 다른 파일을 사용하여 정보 추출 - 구문 오류

>ENST00000342992.10 cdna:known chromosome:GRCh38:2:178525989:178807421:-1 gene:ENSG00000155657.24 gene_biotype:protein_coding transcript_biotype:protein_coding gene_symbol:TTN description:titin [Source:HGNC Symbol;Acc:HGNC:12403] 
GCAGTCGTGCATTCCCAGCCTCGCCTCGGGTGTAGGGATTGCATAGAAAAGCAAAACTAC 
ACAGTCTTGACTGTGTAGTTTTGTTTTTAGGATTAGAGGCTCACCGATTCATGTCGGAGA 
TGGTCAGAAAAACCAACTCTCCATAGGACGTCGTTTCAGAAGCAACCTTGGGCTTAGTCC 
CACCCTTTTTAGGCACTCTTGAGAAATCAGAGTGCCTAGAAAGATGACAACTCAAGCACC 
GACGTTTACGCAGCCGTTACAAAGCGTTGTGGTACTGGAGGGTAGTACCGCAACCTTTGA 
GGCTCACATTAGTGGTTTTCCAGTTCCTGAGGTGAGCTGGTTTAGGGATGGCCAGGTGAT 
TTCCACTTCCACTCTGCCCGGCGTGCAGATCTCCTTTAGCGATGGCCGCGCTAAACTGAC 
GATCCCCGCCGTGACTAAAGCCAACAGTGGACGATATTCCCTGAAAGCCACCAATGGATC 
TGGACAAGCGACTAGTACTGCTGAGCTTCTCGTGAAAGCTGAGACAGCACCACCCAACTT 
CGTTCAACGACTGCAGAGCATGACCGTGAGACAAGGAAGCCAAGTGAGACTCCAAGTGAG 
AGTGACTGGAATCCCTACACCTGTGGTGAAGTTCTACCGGGATGGAGCCGAAATCCAGAG 
CTCCCTTGATTTCCAAATTTCACAAGAAGGCGACCTCTACAGCTTACTGATTGCAGAAGC 
ATACCCTGAGGACTCAGGGACCTATTCAGTAAATGCCACCAATAGCGTTGGAAGAGCTAC 
TTCGACTGCTGAATTACTGGTTCAAGGTGAAGAAGAAGTACCTGCTAAAAAGACAAAGAC 
AATTGTTTCGACTGCTCAGATCTCAGAATCAAGACAAACCCGAATTGAAAAGAAGATTGA 
AGCCCACTTTGATGCCAGATCAATTGCAACAGTTGAGATGGTCATAGATGGTGCCGCTGG 
GCAACAGCTGCCACATAAAACACCTCCCAGGATTCCTCCGAAGCCAAAGTCAAGATCCCC 
AACACCACCGTCTATTGCTGCCAAAGCACAGCTGGCTCGGCAGCAGTCCCCATCGCCCAT 
AAGACACTCCCCTTCCCCGGTCAGACACGTGCGGGCACCGACCCCATCTCCGGTCAGGTC 
CGTGTCTCCAGCAGCAAGAATCTCCACATCCCCCATCAGGTCTGTTAGGTCTCCATTGCT 
CATGCGTAAGACTCAGGCATCCACCGTGGCCACAGGTCCTGAAGTGCCTCCCCCTTGGAA 
GCAAGAGGGCTACGTGGCCTCCTCATCTGAGGCTGAGATGAGAGAGACAACGCTGACAAC 
CTCTACTCAGATCAGGACAGAAGAGAGATGGGAAGGGAGATACGGTGTCCAGGAGCAAGT 
GACCATCAGTGGTGCTGCGGGTGCTGCCGCCAGTGTGTCGGCCAGTGCTAGCTACGCAGC 
AGAGGCTGTTGCCACTGGTGCTAAAGAGGTGAAACAAGATGCTGACAAAAGTGCAGCTGT 
TGCGACTGTTGTTGCTGCCGTTGATATGGCCAGAGTGAGAGAACCAGTGATCAGCGCTGT 
AGAGCAGACTGCTCAGAGGACAACCACGACTGCTGTGCACATCCAACCTGCTCAAGAACA 
GGTAAGAAAGGAAGCGGAGAAGACTGCTGTAACTAAGGTAGTAGTGGCCGCCGATAAAGC 
CAAGGAACAAGAATTAAAATCAAGAACCAAAGAAGTAATTACCACAAAGCAAGAGCAGAT 
GCACGTAACTCATGAGCAGATAAGAAAAGAAACTGAAAAAACATTTGTACCAAAGGTAGT 
AATTTCCGCAGCTAAAGCCAAAGAACAAGAAACTAGAATTTCTGAAGAAATTACTAAGAA 
ACAGAAACAAGTAACTCAAGAAGCAATAAGACAGGAAACTGAGATAACTGCTGCATCCAT 
GGTGGTAGTTGCCACTGCAAAGTCCACAAAACTAGAAACAGTCCCGGGAGCTCAAGAAGA 
AACTACCACACAACAAGATCAAATGCACCTAAGTTATGAAAAGATAATGAAGGAAACTAG 
GAAAACAGTTGTACCTAAAGTCATAGTTGCCACACCCAAAGTCAAAGAACAAGATTTAGT 
>ENST00000460472.6 cdna:known chromosome:GRCh38:2:178525989:178807423:-1 gene:ENSG00000155657.24 gene_biotype:protein_coding transcript_biotype:protein_coding gene_symbol:TTN description:titin [Source:HGNC Symbol;Acc:HGNC:12403] 
GCAGTCGTGCATTCCCAGCCTCGCCTCGGGTGTAGGGATTGCATAGAAAAGCAAAACTAC 
ACAGTCTTGACTGTGTAGTTTTGTTTTTAGGATTAGAGGCTCACCGATTCATGTCGGAGA 
TGGTCAGAAAAACCAACTCTCCATAGGACGTCGTTTCAGAAGCAACCTTGGGCTTAGTCC 
CACCCTTTTTAGGCACTCTTGAGAAATCAGAGTGCCTAGAAAGATGACAACTCAAGCACC 
GACGTTTACGCAGCCGTTACAAAGCGTTGTGGTACTGGAGGGTAGTACCGCAACCTTTGA 
GGCTCACATTAGTGGTTTTCCAGTTCCTGAGGTGAGCTGGTTTAGGGATGGCCAGGTGAT 
TTCCACTTCCACTCTGCCCGGCGTGCAGATCTCCTTTAGCGATGGCCGCGCTAAACTGAC 
GATCCCCGCCGTGACTAAAGCCAACAGTGGACGATATTCCCTGAAAGCCACCAATGGATC 
TGGACAAGCGACTAGTACTGCTGAGCTTCTCGTGAAAGCTGAGACAGCACCACCCAACTT 
CGTTCAACGACTGCAGAGCATGACCGTGAGACAAGGAAGCCAAGTGAGACTCCAAGTGAG 
AGTGACTGGAATCCCTACACCTGTGGTGAAGTTCTACCGGGATGGAGCCGAAATCCAGAG 
CTCCCTTGATTTCCAAATTTCACAAGAAGGCGACCTCTACAGCTTACTGATTGCAGAAGC 
ATACCCTGAGGACTCAGGGACCTATTCAGTAAATGCCACCAATAGCGTTGGAAGAGCTAC 
TTCGACTGCTGAATTACTGGTTCAAGGTGAAGAAGAAGTACCTGCTAAAAAGACAAAGAC 
AATTGTTTCGACTGCTCAGATCTCAGAATCAAGACAAACCCGAATTGAAAAGAAGATTGA 
AGCCCACTTTGATGCCAGATCAATTGCAACAGTTGAGATGGTCATAGATGGTGCCGCTGG 
GCAACAGCTGCCACATAAAACACCTCCCAGGATTCCTCCGAAGCCAAAGTCAAGATCCCC 
AACACCACCGTCTATTGCTGCCAAAGCACAGCTGGCTCGGCAGCAGTCCCCATCGCCCAT 
AAGACACTCCCCTTCCCCGGTCAGACACGTGCGGGCACCGACCCCATCTCCGGTCAGGTC 
CGTGTCTCCAGCAGCAAGAATCTCCACATCCCCCATCAGGTCTGTTAGGTCTCCATTGCT 
CATGCGTAAGACTCAGGCATCCACCGTGGCCACAGGTCCTGAAGTGCCTCCCCCTTGGAA 
GCAAGAGGGCTACGTGGCCTCCTCATCTGAGGCTGAGATGAGAGAGACAACGCTGACAAC 
CTCTACTCAGATCAGGACAGAAGAGAGATGGGAAGGGAGATACGGTGTCCAGGAGCAAGT 
GACCATCAGTGGTGCTGCGGGTGCTGCCGCCAGTGTGTCGGCCAGTGCTAGCTACGCAGC 
AGAGGCTGTTGCCACTGGTGCTAAAGAGGTGAAACAAGATGCTGACAAAAGTGCAGCTGT 
TGCGACTGTTGTTGCTGCCGTTGATATGGCCAGAGTGAGAGAACCAGTGATCAGCGCTGT 
AGAGCAGACTGCTCAGAGGACAACCACGACTGCTGTGCACATCCAACCTGCTCAAGAACA 
GGTAAGAAAGGAAGCGGAGAAGACTGCTGTAACTAAGGTAGTAGTGGCCGCCGATAAAGC 
CAAGGAACAAGAATTAAAATCAAGAACCAAAGAAGTAATTACCACAAAGCAAGAGCAGAT 
GCACGTAACTCATGAGCAGATAAGAAAAGAAACTGAAAAAACATTTGTACCAAAGGTAGT 
AATTTCCGCAGCTAAAGCCAAAGAACAAGAAACTAGAATTTCTGAAGAAATTACTAAGAA 
ACAGAAACAAGTAACTCAAGAAGCAATAAGACAGGAAACTGAGATAACTGCTGCATCCAT 
GGTGGTAGTTGCCACTGCAAAGTCCACAAAACTAGAAACAGTCCCGGGAGCTCAAGAAGA 
AACTACCACACAACAAGATCAAATGCACCTAAGTTATGAAAAGATAATGAAGGAAACTAG 
GAAAACAGTTGTACCTAAAGTCATAGTTGCCACACCCAAAGTCAAAGAACAAGATTTAGT 
>ENST00000589042.5 cdna:known chromosome:GRCh38:2:178525989:178807423:-1 gene:ENSG00000155657.24 gene_biotype:protein_coding transcript_biotype:protein_coding gene_symbol:TTN description:titin [Source:HGNC Symbol;Acc:HGNC:12403] 
GCAGTCGTGCATTCCCAGCCTCGCCTCGGGTGTAGGGATTGCATAGAAAAGCAAAACTAC 
ACAGTCTTGACTGTGTAGTTTTGTTTTTAGGATTAGAGGCTCACCGATTCATGTCGGAGA 
TGGTCAGAAAAACCAACTCTCCATAGGACGTCGTTTCAGAAGCAACCTTGGGCTTAGTCC 
CACCCTTTTTAGGCACTCTTGAGAAATCAGAGTGCCTAGAAAGATGACAACTCAAGCACC 
GACGTTTACGCAGCCGTTACAAAGCGTTGTGGTACTGGAGGGTAGTACCGCAACCTTTGA 
GGCTCACATTAGTGGTTTTCCAGTTCCTGAGGTGAGCTGGTTTAGGGATGGCCAGGTGAT 
TTCCACTTCCACTCTGCCCGGCGTGCAGATCTCCTTTAGCGATGGCCGCGCTAAACTGAC 
GATCCCCGCCGTGACTAAAGCCAACAGTGGACGATATTCCCTGAAAGCCACCAATGGATC 
TGGACAAGCGACTAGTACTGCTGAGCTTCTCGTGAAAGCTGAGACAGCACCACCCAACTT 
CGTTCAACGACTGCAGAGCATGACCGTGAGACAAGGAAGCCAAGTGAGACTCCAAGTGAG 
AGTGACTGGAATCCCTACACCTGTGGTGAAGTTCTACCGGGATGGAGCCGAAATCCAGAG 
CTCCCTTGATTTCCAAATTTCACAAGAAGGCGACCTCTACAGCTTACTGATTGCAGAAGC 
ATACCCTGAGGACTCAGGGACCTATTCAGTAAATGCCACCAATAGCGTTGGAAGAGCTAC 
TTCGACTGCTGAATTACTGGTTCAAGGTGAAGAAGAAGTACCTGCTAAAAAGACAAAGAC 
AATTGTTTCGACTGCTCAGATCTCAGAATCAAGACAAACCCGAATTGAAAAGAAGATTGA 
AGCCCACTTTGATGCCAGATCAATTGCAACAGTTGAGATGGTCATAGATGGTGCCGCTGG 
GCAACAGCTGCCACATAAAACACCTCCCAGGATTCCTCCGAAGCCAAAGTCAAGATCCCC 
AACACCACCGTCTATTGCTGCCAAAGCACAGCTGGCTCGGCAGCAGTCCCCATCGCCCAT 
AAGACACTCCCCTTCCCCGGTCAGACACGTGCGGGCACCGACCCCATCTCCGGTCAGGTC 
CGTGTCTCCAGCAGCAAGAATCTCCACATCCCCCATCAGGTCTGTTAGGTCTCCATTGCT 
CATGCGTAAGACTCAGGCATCCACCGTGGCCACAGGTCCTGAAGTGCCTCCCCCTTGGAA 
GCAAGAGGGCTACGTGGCCTCCTCATCTGAGGCTGAGATGAGAGAGACAACGCTGACAAC 
CTCTACTCAGATCAGGACAGAAGAGAGATGGGAAGGGAGATACGGTGTCCAGGAGCAAGT 
GACCATCAGTGGTGCTGCGGGTGCTGCCGCCAGTGTGTCGGCCAGTGCTAGCTACGCAGC 
AGAGGCTGTTGCCACTGGTGCTAAAGAGGTGAAACAAGATGCTGACAAAAGTGCAGCTGT 
TGCGACTGTTGTTGCTGCCGTTGATATGGCCAGAGTGAGAGAACCAGTGATCAGCGCTGT 
AGAGCAGACTGCTCAGAGGACAACCACGACTGCTGTGCACATCCAACCTGCTCAAGAACA 
GGTAAGAAAGGAAGCGGAGAAGACTGCTGTAACTAAGGTAGTAGTGGCCGCCGATAAAGC 
CAAGGAACAAGAATTAAAATCAAGAACCAAAGAAGTAATTACCACAAAGCAAGAGCAGAT 
GCACGTAACTCATGAGCAGATAAGAAAAGAAACTGAAAAAACATTTGTACCAAAGGTAGT 
AATTTCCGCAGCTAAAGCCAAAGAACAAGAAACTAGAATTTCTGAAGAAATTACTAAGAA 
ACAGAAACAAGTAACTCAAGAAGCAATAAGACAGGAAACTGAGATAACTGCTGCATCCAT 
GGTGGTAGTTGCCACTGCAAAGTCCACAAAACTAGAAACAGTCCCGGGAGCTCAAGAAGA 
AACTACCACACAACAAGATCAAATGCACCTAAGTTATGAAAAGATAATGAAGGAAACTAG 
GAAAACAGTTGTACCTAAAGTCATAGTTGCCACACCCAAAGTCAAAGAACAAGATTTAGT 
>ENST00000591111.5 cdna:known chromosome:GRCh38:2:178525989:178807423:-1 gene:ENSG00000155657.24 gene_biotype:protein_coding transcript_biotype:protein_coding gene_symbol:TTN description:titin [Source:HGNC Symbol;Acc:HGNC:12403] 
GCAGTCGTGCATTCCCAGCCTCGCCTCGGGTGTAGGGATTGCATAGAAAAGCAAAACTAC 
ACAGTCTTGACTGTGTAGTTTTGTTTTTAGGATTAGAGGCTCACCGATTCATGTCGGAGA 
TGGTCAGAAAAACCAACTCTCCATAGGACGTCGTTTCAGAAGCAACCTTGGGCTTAGTCC 
CACCCTTTTTAGGCACTCTTGAGAAATCAGAGTGCCTAGAAAGATGACAACTCAAGCACC 
GACGTTTACGCAGCCGTTACAAAGCGTTGTGGTACTGGAGGGTAGTACCGCAACCTTTGA 
GGCTCACATTAGTGGTTTTCCAGTTCCTGAGGTGAGCTGGTTTAGGGATGGCCAGGTGAT 
TTCCACTTCCACTCTGCCCGGCGTGCAGATCTCCTTTAGCGATGGCCGCGCTAAACTGAC 
GATCCCCGCCGTGACTAAAGCCAACAGTGGACGATATTCCCTGAAAGCCACCAATGGATC 
TGGACAAGCGACTAGTACTGCTGAGCTTCTCGTGAAAGCTGAGACAGCACCACCCAACTT 
CGTTCAACGACTGCAGAGCATGACCGTGAGACAAGGAAGCCAAGTGAGACTCCAAGTGAG 
AGTGACTGGAATCCCTACACCTGTGGTGAAGTTCTACCGGGATGGAGCCGAAATCCAGAG 
CTCCCTTGATTTCCAAATTTCACAAGAAGGCGACCTCTACAGCTTACTGATTGCAGAAGC 
ATACCCTGAGGACTCAGGGACCTATTCAGTAAATGCCACCAATAGCGTTGGAAGAGCTAC 
TTCGACTGCTGAATTACTGGTTCAAGGTGAAGAAGAAGTACCTGCTAAAAAGACAAAGAC 
AATTGTTTCGACTGCTCAGATCTCAGAATCAAGACAAACCCGAATTGAAAAGAAGATTGA 
AGCCCACTTTGATGCCAGATCAATTGCAACAGTTGAGATGGTCATAGATGGTGCCGCTGG 
GCAACAGCTGCCACATAAAACACCTCCCAGGATTCCTCCGAAGCCAAAGTCAAGATCCCC 
AACACCACCGTCTATTGCTGCCAAAGCACAGCTGGCTCGGCAGCAGTCCCCATCGCCCAT 
AAGACACTCCCCTTCCCCGGTCAGACACGTGCGGGCACCGACCCCATCTCCGGTCAGGTC 
CGTGTCTCCAGCAGCAAGAATCTCCACATCCCCCATCAGGTCTGTTAGGTCTCCATTGCT 
CATGCGTAAGACTCAGGCATCCACCGTGGCCACAGGTCCTGAAGTGCCTCCCCCTTGGAA 
GCAAGAGGGCTACGTGGCCTCCTCATCTGAGGCTGAGATGAGAGAGACAACGCTGACAAC 
CTCTACTCAGATCAGGACAGAAGAGAGATGGGAAGGGAGATACGGTGTCCAGGAGCAAGT 
GACCATCAGTGGTGCTGCGGGTGCTGCCGCCAGTGTGTCGGCCAGTGCTAGCTACGCAGC 
AGAGGCTGTTGCCACTGGTGCTAAAGAGGTGAAACAAGATGCTGACAAAAGTGCAGCTGT 
TGCGACTGTTGTTGCTGCCGTTGATATGGCCAGAGTGAGAGAACCAGTGATCAGCGCTGT 
AGAGCAGACTGCTCAGAGGACAACCACGACTGCTGTGCACATCCAACCTGCTCAAGAACA 
GGTAAGAAAGGAAGCGGAGAAGACTGCTGTAACTAAGGTAGTAGTGGCCGCCGATAAAGC 
CAAGGAACAAGAATTAAAATCAAGAACCAAAGAAGTAATTACCACAAAGCAAGAGCAGAT 
GCACGTAACTCATGAGCAGATAAGAAAAGAAACTGAAAAAACATTTGTACCAAAGGTAGT 
AATTTCCGCAGCTAAAGCCAAAGAACAAGAAACTAGAATTTCTGAAGAAATTACTAAGAA 
ACAGAAACAAGTAACTCAAGAAGCAATAAGACAGGAAACTGAGATAACTGCTGCATCCAT 
GGTGGTAGTTGCCACTGCAAAGTCCACAAAACTAGAAACAGTCCCGGGAGCTCAAGAAGA 
AACTACCACACAACAAGATCAAATGCACCTAAGTTATGAAAAGATAATGAAGGAAACTAG 
GAAAACAGTTGTACCTAAAGTCATAGTTGCCACACCCAAAGTCAAAGAACAAGATTTAGT 
>ENST00000425332.2 cdna:known chromosome:GRCh38:2:178663627:178667307:-1 gene:ENSG00000155657.24 gene_biotype:protein_coding transcript_biotype:protein_coding gene_symbol:TTN description:titin [Source:HGNC Symbol;Acc:HGNC:12403] 
GCAGTCGTGCATTCCCAGCCTCGCCTCGGGTGTAGGGATTGCATAGAAAAGCAAAACTAC 
ACAGTCTTGACTGTGTAGTTTTGTTTTTAGGATTAGAGGCTCACCGATTCATGTCGGAGA 
TGGTCAGAAAAACCAACTCTCCATAGGACGTCGTTTCAGAAGCAACCTTGGGCTTAGTCC 
CACCCTTTTTAGGCACTCTTGAGAAATCAGAGTGCCTAGAAAGATGACAACTCAAGCACC 
GACGTTTACGCAGCCGTTACAAAGCGTTGTGGTACTGGAGGGTAGTACCGCAACCTTTGA 
GGCTCACATTAGTGGTTTTCCAGTTCCTGAGGTGAGCTGGTTTAGGGATGGCCAGGTGAT 
TTCCACTTCCACTCTGCCCGGCGTGCAGATCTCCTTTAGCGATGGCCGCGCTAAACTGAC 
GATCCCCGCCGTGACTAAAGCCAACAGTGGACGATATTCCCTGAAAGCCACCAATGGATC 
TGGACAAGCGACTAGTACTGCTGAGCTTCTCGTGAAAGCTGAGACAGCACCACCCAACTT 
CGTTCAACGACTGCAGAGCATGACCGTGAGACAAGGAAGCCAAGTGAGACTCCAAGTGAG 
AGTGACTGGAATCCCTACACCTGTGGTGAAGTTCTACCGGGATGGAGCCGAAATCCAGAG 
CTCCCTTGATTTCCAAATTTCACAAGAAGGCGACCTCTACAGCTTACTGATTGCAGAAGC 
ATACCCTGAGGACTCAGGGACCTATTCAGTAAATGCCACCAATAGCGTTGGAAGAGCTAC 
TTCGACTGCTGAATTACTGGTTCAAGGTGAAGAAGAAGTACCTGCTAAAAAGACAAAGAC 
AATTGTTTCGACTGCTCAGATCTCAGAATCAAGACAAACCCGAATTGAAAAGAAGATTGA 
AGCCCACTTTGATGCCAGATCAATTGCAACAGTTGAGATGGTCATAGATGGTGCCGCTGG 
GCAACAGCTGCCACATAAAACACCTCCCAGGATTCCTCCGAAGCCAAAGTCAAGATCCCC 
AACACCACCGTCTATTGCTGCCAAAGCACAGCTGGCTCGGCAGCAGTCCCCATCGCCCAT 
AAGACACTCCCCTTCCCCGGTCAGACACGTGCGGGCACCGACCCCATCTCCGGTCAGGTC 
CGTGTCTCCAGCAGCAAGAATCTCCACATCCCCCATCAGGTCTGTTAGGTCTCCATTGCT 
CATGCGTAAGACTCAGGCATCCACCGTGGCCACAGGTCCTGAAGTGCCTCCCCCTTGGAA 
GCAAGAGGGCTACGTGGCCTCCTCATCTGAGGCTGAGATGAGAGAGACAACGCTGACAAC 
CTCTACTCAGATCAGGACAGAAGAGAGATGGGAAGGGAGATACGGTGTCCAGGAGCAAGT 
GACCATCAGTGGTGCTGCGGGTGCTGCCGCCAGTGTGTCGGCCAGTGCTAGCTACGCAGC 
AGAGGCTGTTGCCACTGGTGCTAAAGAGGTGAAACAAGATGCTGACAAAAGTGCAGCTGT 
TGCGACTGTTGTTGCTGCCGTTGATATGGCCAGAGTGAGAGAACCAGTGATCAGCGCTGT 
AGAGCAGACTGCTCAGAGGACAACCACGACTGCTGTGCACATCCAACCTGCTCAAGAACA 
GGTAAGAAAGGAAGCGGAGAAGACTGCTGTAACTAAGGTAGTAGTGGCCGCCGATAAAGC 
CAAGGAACAAGAATTAAAATCAAGAACCAAAGAAGTAATTACCACAAAGCAAGAGCAGAT 
GCACGTAACTCATGAGCAGATAAGAAAAGAAACTGAAAAAACATTTGTACCAAAGGTAGT 
AATTTCCGCAGCTAAAGCCAAAGAACAAGAAACTAGAATTTCTGAAGAAATTACTAAGAA 
ACAGAAACAAGTAACTCAAGAAGCAATAAGACAGGAAACTGAGATAACTGCTGCATCCAT 
GGTGGTAGTTGCCACTGCAAAGTCCACAAAACTAGAAACAGTCCCGGGAGCTCAAGAAGA 
AACTACCACACAACAAGATCAAATGCACCTAAGTTATGAAAAGATAATGAAGGAAACTAG 
GAAAACAGTTGTACCTAAAGTCATAGTTGCCACACCCAAAGTCAAAGAACAAGATTTAGT 
>ENST00000448510.2 cdna:known chromosome:GRCh38:2:178669625:178672418:-1 gene:ENSG00000155657.24 gene_biotype:protein_coding transcript_biotype:protein_coding gene_symbol:TTN description:titin [Source:HGNC Symbol;Acc:HGNC:12403] 
GCAGTCGTGCATTCCCAGCCTCGCCTCGGGTGTAGGGATTGCATAGAAAAGCAAAACTAC 
ACAGTCTTGACTGTGTAGTTTTGTTTTTAGGATTAGAGGCTCACCGATTCATGTCGGAGA 
TGGTCAGAAAAACCAACTCTCCATAGGACGTCGTTTCAGAAGCAACCTTGGGCTTAGTCC 
CACCCTTTTTAGGCACTCTTGAGAAATCAGAGTGCCTAGAAAGATGACAACTCAAGCACC 
GACGTTTACGCAGCCGTTACAAAGCGTTGTGGTACTGGAGGGTAGTACCGCAACCTTTGA 
GGCTCACATTAGTGGTTTTCCAGTTCCTGAGGTGAGCTGGTTTAGGGATGGCCAGGTGAT 
TTCCACTTCCACTCTGCCCGGCGTGCAGATCTCCTTTAGCGATGGCCGCGCTAAACTGAC 
GATCCCCGCCGTGACTAAAGCCAACAGTGGACGATATTCCCTGAAAGCCACCAATGGATC 
TGGACAAGCGACTAGTACTGCTGAGCTTCTCGTGAAAGCTGAGACAGCACCACCCAACTT 
CGTTCAACGACTGCAGAGCATGACCGTGAGACAAGGAAGCCAAGTGAGACTCCAAGTGAG 
AGTGACTGGAATCCCTACACCTGTGGTGAAGTTCTACCGGGATGGAGCCGAAATCCAGAG 
CTCCCTTGATTTCCAAATTTCACAAGAAGGCGACCTCTACAGCTTACTGATTGCAGAAGC 
ATACCCTGAGGACTCAGGGACCTATTCAGTAAATGCCACCAATAGCGTTGGAAGAGCTAC 
TTCGACTGCTGAATTACTGGTTCAAGGTGAAGAAGAAGTACCTGCTAAAAAGACAAAGAC 
AATTGTTTCGACTGCTCAGATCTCAGAATCAAGACAAACCCGAATTGAAAAGAAGATTGA 
AGCCCACTTTGATGCCAGATCAATTGCAACAGTTGAGATGGTCATAGATGGTGCCGCTGG 
GCAACAGCTGCCACATAAAACACCTCCCAGGATTCCTCCGAAGCCAAAGTCAAGATCCCC 
AACACCACCGTCTATTGCTGCCAAAGCACAGCTGGCTCGGCAGCAGTCCCCATCGCCCAT 
AAGACACTCCCCTTCCCCGGTCAGACACGTGCGGGCACCGACCCCATCTCCGGTCAGGTC 
CGTGTCTCCAGCAGCAAGAATCTCCACATCCCCCATCAGGTCTGTTAGGTCTCCATTGCT 
CATGCGTAAGACTCAGGCATCCACCGTGGCCACAGGTCCTGAAGTGCCTCCCCCTTGGAA 
GCAAGAGGGCTACGTGGCCTCCTCATCTGAGGCTGAGATGAGAGAGACAACGCTGACAAC 
CTCTACTCAGATCAGGACAGAAGAGAGATGGGAAGGGAGATACGGTGTCCAGGAGCAAGT 
GACCATCAGTGGTGCTGCGGGTGCTGCCGCCAGTGTGTCGGCCAGTGCTAGCTACGCAGC 
AGAGGCTGTTGCCACTGGTGCTAAAGAGGTGAAACAAGATGCTGACAAAAGTGCAGCTGT 
TGCGACTGTTGTTGCTGCCGTTGATATGGCCAGAGTGAGAGAACCAGTGATCAGCGCTGT 
AGAGCAGACTGCTCAGAGGACAACCACGACTGCTGTGCACATCCAACCTGCTCAAGAACA 
GGTAAGAAAGGAAGCGGAGAAGACTGCTGTAACTAAGGTAGTAGTGGCCGCCGATAAAGC 
CAAGGAACAAGAATTAAAATCAAGAACCAAAGAAGTAATTACCACAAAGCAAGAGCAGAT 
GCACGTAACTCATGAGCAGATAAGAAAAGAAACTGAAAAAACATTTGTACCAAAGGTAGT 
AATTTCCGCAGCTAAAGCCAAAGAACAAGAAACTAGAATTTCTGAAGAAATTACTAAGAA 
ACAGAAACAAGTAACTCAAGAAGCAATAAGACAGGAAACTGAGATAACTGCTGCATCCAT 
GGTGGTAGTTGCCACTGCAAAGTCCACAAAACTAGAAACAGTCCCGGGAGCTCAAGAAGA 
AACTACCACACAACAAGATCAAATGCACCTAAGTTATGAAAAGATAATGAAGGAAACTAG 
GAAAACAGTTGTACCTAAAGTCATAGTTGCCACACCCAAAGTCAAAGAACAAGATTTAGT 
>ENST00000360870.9 cdna:known chromosome:GRCh38:2:178744405:178807421:-1 gene:ENSG00000155657.24 gene_biotype:protein_coding transcript_biotype:protein_coding gene_symbol:TTN description:titin [Source:HGNC Symbol;Acc:HGNC:12403] 
GCAGTCGTGCATTCCCAGCCTCGCCTCGGGTGTAGGGATTGCATAGAAAAGCAAAACTAC 
ACAGTCTTGACTGTGTAGTTTTGTTTTTAGGATTAGAGGCTCACCGATTCATGTCGGAGA 
TGGTCAGAAAAACCAACTCTCCATAGGACGTCGTTTCAGAAGCAACCTTGGGCTTAGTCC 
CACCCTTTTTAGGCACTCTTGAGAAATCAGAGTGCCTAGAAAGATGACAACTCAAGCACC 
GACGTTTACGCAGCCGTTACAAAGCGTTGTGGTACTGGAGGGTAGTACCGCAACCTTTGA 
GGCTCACATTAGTGGTTTTCCAGTTCCTGAGGTGAGCTGGTTTAGGGATGGCCAGGTGAT 
TTCCACTTCCACTCTGCCCGGCGTGCAGATCTCCTTTAGCGATGGCCGCGCTAAACTGAC 
GATCCCCGCCGTGACTAAAGCCAACAGTGGACGATATTCCCTGAAAGCCACCAATGGATC 
TGGACAAGCGACTAGTACTGCTGAGCTTCTCGTGAAAGCTGAGACAGCACCACCCAACTT 
CGTTCAACGACTGCAGAGCATGACCGTGAGACAAGGAAGCCAAGTGAGACTCCAAGTGAG 
AGTGACTGGAATCCCTACACCTGTGGTGAAGTTCTACCGGGATGGAGCCGAAATCCAGAG 
CTCCCTTGATTTCCAAATTTCACAAGAAGGCGACCTCTACAGCTTACTGATTGCAGAAGC 
ATACCCTGAGGACTCAGGGACCTATTCAGTAAATGCCACCAATAGCGTTGGAAGAGCTAC 
TTCGACTGCTGAATTACTGGTTCAAGGTGAAGAAGAAGTACCTGCTAAAAAGACAAAGAC 
AATTGTTTCGACTGCTCAGATCTCAGAATCAAGACAAACCCGAATTGAAAAGAAGATTGA 
AGCCCACTTTGATGCCAGATCAATTGCAACAGTTGAGATGGTCATAGATGGTGCCGCTGG 
GCAACAGCTGCCACATAAAACACCTCCCAGGATTCCTCCGAAGCCAAAGTCAAGATCCCC 
AACACCACCGTCTATTGCTGCCAAAGCACAGCTGGCTCGGCAGCAGTCCCCATCGCCCAT 
AAGACACTCCCCTTCCCCGGTCAGACACGTGCGGGCACCGACCCCATCTCCGGTCAGGTC 
CGTGTCTCCAGCAGCAAGAATCTCCACATCCCCCATCAGGTCTGTTAGGTCTCCATTGCT 
CATGCGTAAGACTCAGGCATCCACCGTGGCCACAGGTCCTGAAGTGCCTCCCCCTTGGAA 
GCAAGAGGGCTACGTGGCCTCCTCATCTGAGGCTGAGATGAGAGAGACAACGCTGACAAC 
CTCTACTCAGATCAGGACAGAAGAGAGATGGGAAGGGAGATACGGTGTCCAGGAGCAAGT 
GACCATCAGTGGTGCTGCGGGTGCTGCCGCCAGTGTGTCGGCCAGTGCTAGCTACGCAGC 
AGAGGCTGTTGCCACTGGTGCTAAAGAGGTGAAACAAGATGCTGACAAAAGTGCAGCTGT 
TGCGACTGTTGTTGCTGCCGTTGATATGGCCAGAGTGAGAGAACCAGTGATCAGCGCTGT 
AGAGCAGACTGCTCAGAGGACAACCACGACTGCTGTGCACATCCAACCTGCTCAAGAACA 
GGTAAGAAAGGAAGCGGAGAAGACTGCTGTAACTAAGGTAGTAGTGGCCGCCGATAAAGC 
CAAGGAACAAGAATTAAAATCAAGAACCAAAGAAGTAATTACCACAAAGCAAGAGCAGAT 
GCACGTAACTCATGAGCAGATAAGAAAAGAAACTGAAAAAACATTTGTACCAAAGGTAGT 
AATTTCCGCAGCTAAAGCCAAAGAACAAGAAACTAGAATTTCTGAAGAAATTACTAAGAA 
ACAGAAACAAGTAACTCAAGAAGCAATAAGACAGGAAACTGAGATAACTGCTGCATCCAT 
GGTGGTAGTTGCCACTGCAAAGTCCACAAAACTAGAAACAGTCCCGGGAGCTCAAGAAGA 
AACTACCACACAACAAGATCAAATGCACCTAAGTTATGAAAAGATAATGAAGGAAACTAG 
GAAAACAGTTGTACCTAAAGTCATAGTTGCCACACCCAAAGTCAAAGAACAAGATTTAGT 
>ENST00000634225.1 cdna:known chromosome:GRCh38:2:178753361:178767825:-1 gene:ENSG00000155657.24 gene_biotype:protein_coding transcript_biotype:protein_coding gene_symbol:TTN description:titin [Source:HGNC Symbol;Acc:HGNC:12403] 
GCAGTCGTGCATTCCCAGCCTCGCCTCGGGTGTAGGGATTGCATAGAAAAGCAAAACTAC 
ACAGTCTTGACTGTGTAGTTTTGTTTTTAGGATTAGAGGCTCACCGATTCATGTCGGAGA 
TGGTCAGAAAAACCAACTCTCCATAGGACGTCGTTTCAGAAGCAACCTTGGGCTTAGTCC 
CACCCTTTTTAGGCACTCTTGAGAAATCAGAGTGCCTAGAAAGATGACAACTCAAGCACC 
GACGTTTACGCAGCCGTTACAAAGCGTTGTGGTACTGGAGGGTAGTACCGCAACCTTTGA 
GGCTCACATTAGTGGTTTTCCAGTTCCTGAGGTGAGCTGGTTTAGGGATGGCCAGGTGAT 
TTCCACTTCCACTCTGCCCGGCGTGCAGATCTCCTTTAGCGATGGCCGCGCTAAACTGAC 
GATCCCCGCCGTGACTAAAGCCAACAGTGGACGATATTCCCTGAAAGCCACCAATGGATC 
TGGACAAGCGACTAGTACTGCTGAGCTTCTCGTGAAAGCTGAGACAGCACCACCCAACTT 
CGTTCAACGACTGCAGAGCATGACCGTGAGACAAGGAAGCCAAGTGAGACTCCAAGTGAG 
AGTGACTGGAATCCCTACACCTGTGGTGAAGTTCTACCGGGATGGAGCCGAAATCCAGAG 
CTCCCTTGATTTCCAAATTTCACAAGAAGGCGACCTCTACAGCTTACTGATTGCAGAAGC 
ATACCCTGAGGACTCAGGGACCTATTCAGTAAATGCCACCAATAGCGTTGGAAGAGCTAC 
TTCGACTGCTGAATTACTGGTTCAAGGTGAAGAAGAAGTACCTGCTAAAAAGACAAAGAC 
AATTGTTTCGACTGCTCAGATCTCAGAATCAAGACAAACCCGAATTGAAAAGAAGATTGA 
AGCCCACTTTGATGCCAGATCAATTGCAACAGTTGAGATGGTCATAGATGGTGCCGCTGG 
GCAACAGCTGCCACATAAAACACCTCCCAGGATTCCTCCGAAGCCAAAGTCAAGATCCCC 
AACACCACCGTCTATTGCTGCCAAAGCACAGCTGGCTCGGCAGCAGTCCCCATCGCCCAT 
AAGACACTCCCCTTCCCCGGTCAGACACGTGCGGGCACCGACCCCATCTCCGGTCAGGTC 
CGTGTCTCCAGCAGCAAGAATCTCCACATCCCCCATCAGGTCTGTTAGGTCTCCATTGCT 
CATGCGTAAGACTCAGGCATCCACCGTGGCCACAGGTCCTGAAGTGCCTCCCCCTTGGAA 
GCAAGAGGGCTACGTGGCCTCCTCATCTGAGGCTGAGATGAGAGAGACAACGCTGACAAC 
CTCTACTCAGATCAGGACAGAAGAGAGATGGGAAGGGAGATACGGTGTCCAGGAGCAAGT 
GACCATCAGTGGTGCTGCGGGTGCTGCCGCCAGTGTGTCGGCCAGTGCTAGCTACGCAGC 
AGAGGCTGTTGCCACTGGTGCTAAAGAGGTGAAACAAGATGCTGACAAAAGTGCAGCTGT 
TGCGACTGTTGTTGCTGCCGTTGATATGGCCAGAGTGAGAGAACCAGTGATCAGCGCTGT 
AGAGCAGACTGCTCAGAGGACAACCACGACTGCTGTGCACATCCAACCTGCTCAAGAACA 
GGTAAGAAAGGAAGCGGAGAAGACTGCTGTAACTAAGGTAGTAGTGGCCGCCGATAAAGC 
CAAGGAACAAGAATTAAAATCAAGAACCAAAGAAGTAATTACCACAAAGCAAGAGCAGAT 
GCACGTAACTCATGAGCAGATAAGAAAAGAAACTGAAAAAACATTTGTACCAAAGGTAGT 
AATTTCCGCAGCTAAAGCCAAAGAACAAGAAACTAGAATTTCTGAAGAAATTACTAAGAA 
ACAGAAACAAGTAACTCAAGAAGCAATAAGACAGGAAACTGAGATAACTGCTGCATCCAT 
GGTGGTAGTTGCCACTGCAAAGTCCACAAAACTAGAAACAGTCCCGGGAGCTCAAGAAGA 
AACTACCACACAACAAGATCAAATGCACCTAAGTTATGAAAAGATAATGAAGGAAACTAG 
GAAAACAGTTGTACCTAAAGTCATAGTTGCCACACCCAAAGTCAAAGAACAAGATTTAGT 
>ENST00000436599.1 cdna:known chromosome:GRCh38:2:178786089:178794954:-1 gene:ENSG00000155657.24 gene_biotype:protein_coding transcript_biotype:protein_coding gene_symbol:TTN description:titin [Source:HGNC Symbol;Acc:HGNC:12403] 
>ENST00000470257.1 cdna:known chromosome:GRCh38:2:178798495:178807408:-1 gene:ENSG00000155657.24 gene_biotype:protein_coding transcript_biotype:retained_intron gene_symbol:TTN description:titin [Source:HGNC Symbol;Acc:HGNC:12403] 
>ENST00000412264.1 cdna:known chromosome:GRCh38:2:178802287:178830802:-1 gene:ENSG00000155657.24 gene_biotype:protein_coding transcript_biotype:protein_coding gene_symbol:TTN description:titin [Source:HGNC Symbol;Acc:HGNC:12403] 
GCAGTCGTGCATTCCCAGCCTCGCCTCGGGTGTAGGGATTGCATAGAAAAGCAAAACTAC 
ACAGTCTTGACTGTGTAGTTTTGTTTTTAGGATTAGAGGCTCACCGATTCATGTCGGAGA 
TGGTCAGAAAAACCAACTCTCCATAGGACGTCGTTTCAGAAGCAACCTTGGGCTTAGTCC 
CACCCTTTTTAGGCACTCTTGAGAAATCAGAGTGCCTAGAAAGATGACAACTCAAGCACC 
GACGTTTACGCAGCCGTTACAAAGCGTTGTGGTACTGGAGGGTAGTACCGCAACCTTTGA 
GGCTCACATTAGTGGTTTTCCAGTTCCTGAGGTGAGCTGGTTTAGGGATGGCCAGGTGAT 
TTCCACTTCCACTCTGCCCGGCGTGCAGATCTCCTTTAGCGATGGCCGCGCTAAACTGAC 
GATCCCCGCCGTGACTAAAGCCAACAGTGGACGATATTCCCTGAAAGCCACCAATGGATC 
TGGACAAGCGACTAGTACTGCTGAGCTTCTCGTGAAAGCTGAGACAGCACCACCCAACTT 
CGTTCAACGACTGCAGAGCATGACCGTGAGACAAGGAAGCCAAGTGAGACTCCAAGTGAG 
AGTGACTGGAATCCCTACACCTGTGGTGAAGTTCTACCGGGATGGAGCCGAAATCCAGAG 
CTCCCTTGATTTCCAAATTTCACAAGAAGGCGACCTCTACAGCTTACTGATTGCAGAAGC 
ATACCCTGAGGACTCAGGGACCTATTCAGTAAATGCCACCAATAGCGTTGGAAGAGCTAC 
TTCGACTGCTGAATTACTGGTTCAAGGTGAAGAAGAAGTACCTGCTAAAAAGACAAAGAC 
AATTGTTTCGACTGCTCAGATCTCAGAATCAAGACAAACCCGAATTGAAAAGAAGATTGA 
AGCCCACTTTGATGCCAGATCAATTGCAACAGTTGAGATGGTCATAGATGGTGCCGCTGG 
GCAACAGCTGCCACATAAAACACCTCCCAGGATTCCTCCGAAGCCAAAGTCAAGATCCCC 
AACACCACCGTCTATTGCTGCCAAAGCACAGCTGGCTCGGCAGCAGTCCCCATCGCCCAT 
AAGACACTCCCCTTCCCCGGTCAGACACGTGCGGGCACCGACCCCATCTCCGGTCAGGTC 
CGTGTCTCCAGCAGCAAGAATCTCCACATCCCCCATCAGGTCTGTTAGGTCTCCATTGCT 
CATGCGTAAGACTCAGGCATCCACCGTGGCCACAGGTCCTGAAGTGCCTCCCCCTTGGAA 
GCAAGAGGGCTACGTGGCCTCCTCATCTGAGGCTGAGATGAGAGAGACAACGCTGACAAC 
CTCTACTCAGATCAGGACAGAAGAGAGATGGGAAGGGAGATACGGTGTCCAGGAGCAAGT 
GACCATCAGTGGTGCTGCGGGTGCTGCCGCCAGTGTGTCGGCCAGTGCTAGCTACGCAGC 
AGAGGCTGTTGCCACTGGTGCTAAAGAGGTGAAACAAGATGCTGACAAAAGTGCAGCTGT 
TGCGACTGTTGTTGCTGCCGTTGATATGGCCAGAGTGAGAGAACCAGTGATCAGCGCTGT 
AGAGCAGACTGCTCAGAGGACAACCACGACTGCTGTGCACATCCAACCTGCTCAAGAACA 
GGTAAGAAAGGAAGCGGAGAAGACTGCTGTAACTAAGGTAGTAGTGGCCGCCGATAAAGC 
CAAGGAACAAGAATTAAAATCAAGAACCAAAGAAGTAATTACCACAAAGCAAGAGCAGAT 
GCACGTAACTCATGAGCAGATAAGAAAAGAAACTGAAAAAACATTTGTACCAAAGGTAGT 
AATTTCCGCAGCTAAAGCCAAAGAACAAGAAACTAGAATTTCTGAAGAAATTACTAAGAA 
ACAGAAACAAGTAACTCAAGAAGCAATAAGACAGGAAACTGAGATAACTGCTGCATCCAT 
GGTGGTAGTTGCCACTGCAAAGTCCACAAAACTAGAAACAGTCCCGGGAGCTCAAGAAGA 
AACTACCACACAACAAGATCAAATGCACCTAAGTTATGAAAAGATAATGAAGGAAACTAG 
GAAAACAGTTGTACCTAAAGTCATAGTTGCCACACCCAAAGTCAAAGAACAAGATTTAGT 
>ENST00000359218.9 cdna:known chromosome:GRCh38:2:178525989:178807423:-1 gene:ENSG00000155657.24 gene_biotype:protein_coding transcript_biotype:protein_coding gene_symbol:TTN description:titin [Source:HGNC Symbol;Acc:HGNC:12403] 
GCAGTCGTGCATTCCCAGCCTCGCCTCGGGTGTAGGGATTGCATAGAAAAGCAAAACTAC 
ACAGTCTTGACTGTGTAGTTTTGTTTTTAGGATTAGAGGCTCACCGATTCATGTCGGAGA 
TGGTCAGAAAAACCAACTCTCCATAGGACGTCGTTTCAGAAGCAACCTTGGGCTTAGTCC 
CACCCTTTTTAGGCACTCTTGAGAAATCAGAGTGCCTAGAAAGATGACAACTCAAGCACC 
GACGTTTACGCAGCCGTTACAAAGCGTTGTGGTACTGGAGGGTAGTACCGCAACCTTTGA 
GGCTCACATTAGTGGTTTTCCAGTTCCTGAGGTGAGCTGGTTTAGGGATGGCCAGGTGAT 
TTCCACTTCCACTCTGCCCGGCGTGCAGATCTCCTTTAGCGATGGCCGCGCTAAACTGAC 
GATCCCCGCCGTGACTAAAGCCAACAGTGGACGATATTCCCTGAAAGCCACCAATGGATC 
TGGACAAGCGACTAGTACTGCTGAGCTTCTCGTGAAAGCTGAGACAGCACCACCCAACTT 
CGTTCAACGACTGCAGAGCATGACCGTGAGACAAGGAAGCCAAGTGAGACTCCAAGTGAG 
AGTGACTGGAATCCCTACACCTGTGGTGAAGTTCTACCGGGATGGAGCCGAAATCCAGAG 
CTCCCTTGATTTCCAAATTTCACAAGAAGGCGACCTCTACAGCTTACTGATTGCAGAAGC 
ATACCCTGAGGACTCAGGGACCTATTCAGTAAATGCCACCAATAGCGTTGGAAGAGCTAC 
TTCGACTGCTGAATTACTGGTTCAAGGTGAAGAAGAAGTACCTGCTAAAAAGACAAAGAC 
AATTGTTTCGACTGCTCAGATCTCAGAATCAAGACAAACCCGAATTGAAAAGAAGATTGA 
AGCCCACTTTGATGCCAGATCAATTGCAACAGTTGAGATGGTCATAGATGGTGCCGCTGG 
GCAACAGCTGCCACATAAAACACCTCCCAGGATTCCTCCGAAGCCAAAGTCAAGATCCCC 
AACACCACCGTCTATTGCTGCCAAAGCACAGCTGGCTCGGCAGCAGTCCCCATCGCCCAT 
AAGACACTCCCCTTCCCCGGTCAGACACGTGCGGGCACCGACCCCATCTCCGGTCAGGTC 
CGTGTCTCCAGCAGCAAGAATCTCCACATCCCCCATCAGGTCTGTTAGGTCTCCATTGCT 
CATGCGTAAGACTCAGGCATCCACCGTGGCCACAGGTCCTGAAGTGCCTCCCCCTTGGAA 
GCAAGAGGGCTACGTGGCCTCCTCATCTGAGGCTGAGATGAGAGAGACAACGCTGACAAC 
CTCTACTCAGATCAGGACAGAAGAGAGATGGGAAGGGAGATACGGTGTCCAGGAGCAAGT 
GACCATCAGTGGTGCTGCGGGTGCTGCCGCCAGTGTGTCGGCCAGTGCTAGCTACGCAGC 
AGAGGCTGTTGCCACTGGTGCTAAAGAGGTGAAACAAGATGCTGACAAAAGTGCAGCTGT 
TGCGACTGTTGTTGCTGCCGTTGATATGGCCAGAGTGAGAGAACCAGTGATCAGCGCTGT 
AGAGCAGACTGCTCAGAGGACAACCACGACTGCTGTGCACATCCAACCTGCTCAAGAACA 
GGTAAGAAAGGAAGCGGAGAAGACTGCTGTAACTAAGGTAGTAGTGGCCGCCGATAAAGC 
CAAGGAACAAGAATTAAAATCAAGAACCAAAGAAGTAATTACCACAAAGCAAGAGCAGAT 
GCACGTAACTCATGAGCAGATAAGAAAAGAAACTGAAAAAACATTTGTACCAAAGGTAGT 
AATTTCCGCAGCTAAAGCCAAAGAACAAGAAACTAGAATTTCTGAAGAAATTACTAAGAA 
ACAGAAACAAGTAACTCAAGAAGCAATAAGACAGGAAACTGAGATAACTGCTGCATCCAT 
GGTGGTAGTTGCCACTGCAAAGTCCACAAAACTAGAAACAGTCCCGGGAGCTCAAGAAGA 
AACTACCACACAACAAGATCAAATGCACCTAAGTTATGAAAAGATAATGAAGGAAACTAG 
GAAAACAGTTGTACCTAAAGTCATAGTTGCCACACCCAAAGTCAAAGAACAAGATTTAGT 

:

informationfile.txt과 유사한 헤더를 수집

내가 쓴
ENST00000342992.10 
ENST00000460472.6 
ENST00000589042.5 
ENST00000591111.5 
ENST00000359218.9 
ENST00000615779.4 
ENST00000342175.10 

AWK 코드를 타겟팅하고 싶은, 그리고 다음 헤더까지 다음 정보와 함께 헤더를 수집하십시오.

#!/bin/awk      
NR == FNR {tags[$1]; next;} 
for (i in tags) { if (i ~ $0) {a=1; print; next;}} 
/>/ {a=0} 
a 

그것은 생산한다 : 아래

awk -f myScript.txt <headerlist.txt> <informationfile.txt> 

코드입니다 : 그러나

>Target Header 
Information attached to header 
. 
. 
. 

, 나는 구문 오류를 얻고있다


나는 그것을 호출 정보가 없다. 화살표는 공백 만있는 문자를 가리 키지 않습니다.

^ Syntax Error 

어떻게 수정하나요?

+1

중괄호 안에'for (i in tags) '를 이동하십시오. – karakfa

+0

실행 중입니다. :) - 그러나 출력이 나오지 않습니다. –

+1

글쎄, 구문 오류가 수정되었습니다. 스크립트에 다른 문제가 있습니다. 예를 들어'a '의 용도는 무엇입니까? – karakfa

답변

1

입력

$ cat HeaderList 
Target Header 
SomeOther Header 

$ cat InfoFile 
>Generic Header 
Information attached to header 
. 
. 
. 
>Target Header 
Information attached to header 
. 
. 
. 
>SomeOther Header 
Information attached to header 
. 
. 
. 

스크립트

while read line 
    do 
    awk 'BEGIN{RS="\n>"}/'"$line"'/{printf ">%s\n",$0}' InfoFile 
    done <HeaderList 

는 출력

>Target Header 
Information attached to header 
. 
. 
. 
>SomeOther Header 
Information attached to header 
. 
. 
. 
+0

"대상 머리글"을 여러 번 사용하려면 어떻게해야합니까? –

+1

@NicholasHayden : 업데이트되었습니다. – sjsam

+0

감사! 그것은 효과가 있었다. 정말 고맙습니다! –

1

나는이 w를 생각한다

$ awk 'NR==FNR{h[$0]; next} 
     $0 in h{c=2} 
     c&&c--' headers file 

>Target Header 
Information attached to header 

헤더가 정확히 동일하면 등호 검사 ($ 0 in h)와 일치시켜 두 줄을 인쇄 할 수 있습니다. 당신이 다음 헤더이 스크립트가 필요로하는 새로운 파일 레이아웃으로

$ awk 'NR==FNR{h[$0]; next} 
      /^>/{p=0} 
     $0 in h{p=1} 
       p' headers file 

>Target Header 
Information attached to header 
. 
. 
. 

까지 인쇄 할 경우 사이에 공백이있는 한

한 같은

$ awk 'NR==FNR{h[">"$0]; next} 
      /^>/{p=0} 
     $1 in h{p=1} 
       p' headers file 

로 수정합니다 키 (헤더 파일에서 사용)와 나머지 레코드가 작동해야합니다. 이제 헤더에는 접두사 ">"가 없습니다.

+0

코드를 실행했지만 출력이 나오지 않습니다. 평등 점검에 대해 자세히 설명해 주시겠습니까? –

+1

입력 파일을 정확히 사용하고 "> 대상 헤더"가 포함 된 헤더 파일을 사용했습니다. 이것은 파일의 어떤 행이'h' (헤더 파일에서 채워져 있는지)에 있는지 검사합니다. 테스트를 위해 질문에서 텍스트를 파일로 복사하고'echo "> Target Header"> headers'를 사용하여 헤더 파일을 생성하고'awk' 스크립트를 실행하십시오. – karakfa

+0

출력을 복제하려고하는데 성공하지 못했습니다. 제공된 입력을 사용하고 코드에 정확하게 적용하려고했습니다. –