The benchmark dataset consists of a positive dataset and a negative dataset . The positive dataset contains 5,000 nucleosome-forming DNA segments, while ...
Supporting Information S1. The benchmark dataset
consists of a positive dataset
and a negative dataset . The positive dataset contains 5,000 nucleosome-forming DNA segments, while the negative dataset contains 5,000 nucleosome-inhibiting DNA segments. Each of these segments is 150-bp long. I. The positive dataset contains the following 5,000 nucleosome-forming DNA sequences of 150-bp long GCCCAGAACGAAGAAATCAGTGCCACGCCAACTCCAAATCCAGAAAGCAGCGCAGGTGCAGATGACACTTC CAGAGAAGCAAGTGCAAGTGCTGAAGGTGCTGAGGCCATTGAAGGCGACTTCATGTCTACTTTGAAGCAAT CGAAGAAG AAGAAATCAGTGCCACGCCAACTCCAAATCCAGAAAGCAGCGCAGGTGCAGATGACACTTCCAGAGAAGCA AGTGCAAGTGCTGAAGGTGCTGAGGCCATTGAAGGCGACTTCATGTCTACTTTGAAGCAATCGAAGAAGAA GCAAGAAA CCATGAGATGTCAGGAATTGTTTCCAAGGTTGGTCCTAAAGTGACAAAGGTGAAGGTTGGCGACCACGTGG TCGTTGATGCTGCCAGCAGTTGTGCGGACCTGCATTGCTGGCCACACTCCAAATTTTACAATTCCAAACCA TGTGATGC TCAGGAATTGTTTCCAAGGTTGGTCCTAAAGTGACAAAGGTGAAGGTTGGCGACCACGTGGTCGTTGATGC TGCCAGCAGTTGTGCGGACCTGCATTGCTGGCCACACTCCAAATTTTACAATTCCAAACCATGTGATGCTT GTCAGAGG GCCATGGAACGACACTTTTACCTCTACTTCTACCGAATTGACCACAGTCACCGGTACCAATGGTTTGCCAA CTGATGAGACCATCATTGTCATCAGAACACCAACAACAGCCACTACTGCCATGACTACAACTCAGCCATGG AACGACAC CGTCACCGGTACCAACGGCGTTCCAACTGACGAAACCGTCATTGTCATCAGAACTCCAACCAGTGAAGGTC TAATCAGCACCACCACTGAACCATGGACTGGCACTTTCACTTCGACTTCCACTGAGGTTACCACCATCACT GGAACCAA GTCCTGAAGATGACGAAGATGAATTGATGGACGACGTTATGGATGATTTGACTGGTTTGTTGGACTCCGTT GACACAACTGGTAAAGGTGTTGTGGTCCAAGCATCCACCTTGGGTTCTTTGGAAGCTTTGTTGGATTTCTT GAAAGACA GTTTGGAAGAACCATTCTTCATTGAGCCATTCAATGATCAGACTGACACGTTGCTCGAAATCCTGGATGAA GAAGCCAAGCAGTTCTTCACGAATCAGGTCACTGGCCTCTTGTGCTTCGATTCCTCTCGTAACCAATCTGA TTAAGACG AAATGACGCACGTCACCGGTACCAACGGCGTTCCAACTGACGAAACCGTCATTGTCATCAGAACTCCAACC AGTGAAGGTCTAATCAGCACCACCACTGAACCATGGACTGGCACTTTCACTTCGACTTCCACTGAGGTTAC CACCATCA TAACAATCACGGTTTCGTCAGTTGGTTGACCGTTAGTACCGGTGACGGTGGTCATCTCAGTGGATGTAGAG GTGAAAGTACCAGTCCATGGTTCAGTGGTGGTGCTGATTAGACCTTCACTAGTTGGAGTTCTGATGACAAT GACGGTTT GCCATGGAACGACACTTTTACCTCTACATCCACTGAAATCACCACCGTCACCGGTACCAATGGTTTGCCAA CTGATGAGACCATCATTGTCATCAGAACACCAACAACAGCCACTACTGCCATGACTACACCTCAGCCATGG AACGACAC ATCACTTTATCGTGCATCTTGACCACGTTATTTCTGCTGGTGAACGAGTGGGGACAGTTCAATTCTGTGGT AACAAGGCCACAATTGGTGGTGGACCGTGACCGACACGCAAAGCTGGAGCTTAATATGGATGTGACATTTC CATCGATG