Open Source Tools (GPL-3.0)
-
BIGBIOCL:
is a software tool based on Apache Spark MLlib for the efficient analysis of DNA methylation
large datasets. As an example, BIGBIOCL can be
used with DNA methylation data for the identification of cancer drivers. It relies on Apache Spark
MLlib and can be executed
both on a single machine and on an Apache Hadoop YARN cluster. BigBioCL is described in the
scientific article Classification of Large DNA Methylation Datasets for Identifying Cancer
Drivers
-
AgroTagger is
a command line application for web documents indexing, creating RDF triples that link a web URL to
some
URIs of a SKOS thesaurus.
This application can be used together with a web crawler: the crawler discovers URLs, the
AgroTagger assigns AGROVOC
URIs to those URLs. In addition to that, titles and descriptions of Web resources can be
extracted.
-
SemaGrow Recommender System is
a software component that computes meaningful combinations between some datasets federated by
SemaGrow,
and generates a new triplestore: the "Recommender Database". Recommender System was funded by
SemaGrow
FP7 EU Project. It computes meaningful combinations between two or more datasets federated by
SemaGrow:
the computation of combinations is based on the matching of AGROVOC URIs between datasets.
Research and University Projects
Java Libraries
-
MergeXmlFiles
Java application that allows merging a set of UTF-8 XML files having the same root element.
-
JFCUtil:
Libreria Java di classi di utilità per lavorare con i files e con le stringhe (lettura,
scrittura, zip, cancellazione, filtri...).
-
RDFBulkUpload
Libreria che consente di inviare molteplici files RDF/XML ad un triplestore (Sesame, OWLIM,
Allegrograph, Virtuoso).