Skip to content

A Markov Model DNA sequence generator to generate pseudo-replicate sequences based on an input sequence

License

Notifications You must be signed in to change notification settings

ErillLab/Markov_DNA_gen

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Markov_DNA_gen

A Markov Model DNA sequence generator to generate pseudo-replicate sequences based on an input sequence.

Requirements

  • NumPy

Installation

It can be installed using pip:

pip3 install .

Or using conda:

conda install erilllab::markov_dna

User guide

Import module:

    from Markov_DNA import MCM

Train model:

    mcm = MCM(k)
    mcm.train(seq)

where:

  • k is the model order
  • seq is the reference sequence to train the model

Generate sequences:

Can be done in two different ways, using a generator in an iterative way:

    for new_seq in mcm.generator(size=l, N=N):
        print(new_seq)

Or by calling a function that generates a list of sequences:

    seqs = mcm.generate(size=l, N=N)

Where:

  • l is the length of the sequence to be generated.
  • N is the number of sequences to be generated.

The advantage of the first method is that you do not need to keep all the sequences in memory, while the second one allows you to obtain a list of sequences directly.

Example

There are some examples located in the examples folder. To execute them, the following command can be used:

    python3 examples/exampleX.py

Where X can be [1..2] depending on the genereting method.

  • Example 1: Shows how to use the function that returns the list of sequences.
  • Example 2: Shows how to use the iterative form of the generator generator.

Authors

Erik Blázquez Fernández (erikblazfer@outlook.es)

About

A Markov Model DNA sequence generator to generate pseudo-replicate sequences based on an input sequence

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 3

  •  
  •  
  •  

Languages