Vân Anh Huynh-Thu

GENIE3 is a machine learning-based approach for the inference of gene regulatory networks from expression data.

Related paper:
Inferring regulatory networks from expression data using tree-based methods
Huynh-Thu, V. A., Irrthum, A., Wehenkel, L., and Geurts, P.
PLoS ONE, 5(9):e12776, 2010.

Four implementations of GENIE3 are available:
Note: All the results presented in the PLoS ONE paper were generated using the MATLAB implementation.

GENIE3 is based on regression trees. To learn these trees, the Python implementation uses the scikit-learn library, the MATLAB and R/C implementations are respectively MATLAB and R wrappers of a C code written by Pierre Geurts, and the R/randomForest implementation uses the randomForest R package.
The R/C implementation is the fastest GENIE3 implementation, and was developed for the SCENIC pipeline to analyze single-cell RNA-seq data (Aibar Santos et al., 2017. Manuscript accepted in Nature Methods.).
The running times of the different GENIE3 implementations are shown below for the DREAM5 networks (in each case, GENIE3 was run using the default parameters). These computing times were measured on a 16GB RAM, Intel Xeon E5520 2.27GHz computer.
GENIE3 running times


GENIE3 was the best performer in two DREAM challenges :