FAQs

From Jstacs
Revision as of 12:32, 7 September 2008 by Grau (talk | contribs)
Jump to navigationJump to search

Handling data

Q: How do I create an AlphabetContainer instance for DNA sequences?
A: AlphabetContainer container = new AlphabetContainer( new DNAAlphabet() );


Q: Why shall I use the AlphabetContainer and not just a simple Alphabet instance?
A: Because for some data you will not have the same alphabet at each position of the sequence, e.g. when using phenotypic data. Hence, we also strongly recommend to always use getAlphabetLengthAt(int) when setting e.g. the size of an array.


Q: How can I create a simple sequence?
A: Try to use the create method of Sequence, e.q. Sequence.create( new AlphabetContainer( new DNAAlphabet() ), "ACGTACGT" );


Q: How can I load my own data?
A: If your sequences are stored either in plain text or in FastA format, you can directly create a new Sample from the file.


Q: I wrote some sophisticated method using BioJava to load my data from a Genbank file/a database/somewhere else. How can I do something similar in Jstacs?
A: You can still use your existing method. Jstacs has an adapter for BioJava SequenceIterators.


Using existing models

Q: Where do I find a list of the models currently implemented in Jstacs?
A: All generative models in Jstacs implement the Model interface.
All discriminative models in Jstacs implement the ScoringFunction interface. You find all the existing implementations in the list of implementing classes of these two interfaces.


Q: I decided for two Models. How do I learn them and classify new data?
A: You can create a new ModelBasedClassifier from your models and use its train and classify methods. If you only want to learn a model from data, e.g. to sample new sequences, you can also directly use the train method of the Model.


Q: I decided for two ScoringFunctions. How do I learn them and classify new data?
A: You can create a new ScoreClassifier from your scoring functions and use its train and classify methods.


Q: How can store and load my model, classifier, ...?
A: All classes that implement Storable have a method toXML() that returns a StringBuffer containing the instance as XML. Such classes should also have a proper constructor with a single argument StringBuffer. This can be used to create a new instance form a StringBuffer that contains an instance as XML. In addition, The class FileManger allows to read and write StringBuffers to the hard drive.

Q: Why does Jstacs use XML to save instances?
A: Because it is human-readable.


Implementing new Models

Q: How do I implement a new generative Model?
A: Write an implementation of the Model interface. For convenience, you can use the abstract AbstractModel class with default implementations for many methods.


Q: How do I implement a new discriminative model?
A: Write an implementation of the ScoringFunction interface. For convenience, you can use the abstract AbstractNormalizableScoringFunction class with default implementations for many methods.


Q: How do I implement a model that can be trained generatively and discriminatively??
A: You can either extend AbstractModel and additionally implement the ScoringFunction interface, or you extend the AbstractNormalizableScoringFunction and additionally implement the [http://www.jstacs.de/api/de/jstacs/models/Model.html Model interface.

Q: The class UserTime does not work! Why?
A: The class UserTime uses native code. Therefore there are at least two possibilities:
A1: You have forgotten to set the Java library path: -Djava.library.path=...
A2: You have to compile the native code on your system.