Journal ArticleOpen Access
PyFeat: a Python-based effective feature generation tool for DNA, RNA and protein sequences
Authors
Author Affiliations
United International University, Griffith University, RIKEN Center for Integrative Medical Sciences, University of the South Pacific, ...
Published InBioinformatics
Year2019
Citations120
Abstract
MOTIVATION: Extracting useful feature set which contains significant discriminatory information is a critical step in effectively presenting sequence data to predict structural, functional, interaction and expression of proteins, DNAs and RNAs. Also, being able to filter features with significant information and avoid sparsity in the extracted features require the employment of efficient feature selection techniques. Here we present PyFeat as a practical and easy to use toolkit implemented in Python for extracting various features from proteins, DNAs and RNAs. To build PyFeat we mainly focused on extracting features that capture information about the interaction of neighboring residues to be able to provide more local information. We then employ AdaBoost technique to select features with maximum discriminatory information. In this way,…
View at Publisher
BORR does not host full-text PDFs. The button above takes you to the original publisher.