BIOINFORMATICS

The Bioinformatics group is a team of professional scientists focusing on study, application, development, and optimization of tools for the analysis of genomic and biological data generated by INGM scientists and collaborators. The facility works closely with the Institute’s researchers and gives access to biological data analyses and elaborations to all research groups. It supports basic and translational research with both standard and customized analyses; it provides and facilitates access to up-to-date as well as novel analytical methods.

Our team works in the Institute’s bioinformatics open space in strict contact with other bioinformatics researchers and graduate students. Our team members are either dedicated to single projects or work in collaboration on multiple ones, according to Institute’s needs, workloads and requirements of principal investigators. Our background is wide and covers biology, systems biology, computer science, biostatistics; our multidisciplinar nature allows us to have a fresh and systemic view of data and their biomedical and clinical context.
The IT personnel grants us the access to INGM’s in-house state-of-the-art high performance computing infrastructure and connectivity.

Activity

Support on experimental design for data-intensive projects, data cleaning and data exploration.
Medium to high throughput gene expression profiling: from RTqPCR arrays to microarrays.
Next generation sequencing analyses: RNA sequencing, whole exome sequencing, custom panels, ChIP sequencing.
Analysis of non-coding RNAs data, cellular and circulating microRNAs.
Multivariate analyses for transcriptomics, genomics and proteomics; features selection, biomarker prioritization, descriptive and inferential biostatistics.
Functional analyses for biological contextualization, gene ontology, methods for pathway analyses.
Advanced functional analyses based on pathways impact, network metrics.
Design and development of software applications for computational biology and genomics.
Training for students and interns as developing bioinformaticians
Internal training in general data literacy and scripting principles aimed at all INGM researchers

Team

Unit Coordinator

Prof. Beatrice Bodega, PhD

Staff

Nome / Name	Ruolo / Role	Email
Valeria Ranzani, PhD	Research Scientist	ranzani@ingm.org
Eugenia Galeota, PhD	Research Scientist	galeota@ingm.org

Affiliated members

Nome / Name	Ruolo / Role	Aff.ted Lab	email
Andrea Gobbini	Researcher	Grifantini	gobbini@ingm.org
Ivan Ferrari	Researcher	Biffo	ferrari@ingm.org
Riccardo Nodari	Researcher	De Francesco	nodari@ingm.org
Benedetto Polimeni	PhD student	Bodega	polimeni@ingm.org
Lorenzo Salviati	PhD student	Bodega	salviati@ingm.org
Emanuele Di Patrizio Soldateschi	PhD student	Lanzuolo	soldateschi@ingm.org
Mattia Battistella	PhD student	Cattaneo	battistella@ingm.org
Francesca Vincenti	Research fellow	Abrignani	vincenti@ingm.org
Gialuca Damaggio	Research fellow	Cattaneo	gianluca.damaggio@unimi.it
Michele Panepuccia	Research fellow	Bodega	panepuccia@ingm.org
Isidora Bijelović	Master student	Bodega	bijelovic@ingm.org
Alen Stambolliu	Master student	Bodega	stambolliu@ingm.org
Carola Miuccio	II level Master intern	Manganaro	miuccio@ingm.org

Equipment

INGM bioinformaticians rely on a in-house high performance computing (HPC) cluster with more than 300 CPUs, 1.5 TB RAM and about 100 TB of disk storage. The infrastructure was deployed and is being maintained by th Information Technology personnel in collaboration with the bioinformatics group. The whole infrastructure is wired with high speed connectivity and protected by secure and backup systems.

Computational activities are performed on the HPC and managed by the Torque/PBS queue system on a series of virtual machines. Both the HCP and VMs run Ubuntu Linux operating system. Minor computational tasks can be also performed locally on personal workstations (Xeon PCs, Windows OS) and/or laptops (Win/iOS), which are also used as HPC clients.

Applications / Software

combiroc (R-package) (GitHub) (CRAN)
The combiroc R package is our most recent implementation of the CombiROC method, introducing additional functions for the automatic selection and optimization of gene signatures, also in the context of single cell RNA sequencing experiments.
CombiROC (http://www.combiroc.eu)
CombiROC is a web application for guided and interactive generation of multimarker panels.
myVCF (http://myvcf.readthedocs.io/en/latest/)
myVCF is a application for high-throughput mutations data management managing multiple sequencing projects created from VCF files; it allows end-users without strong programming and bioinformatics skills to explore, query, visualize and export mutations data in a simple and straightforward way.
miRiadne
miRiadne is a tool for re-annotating miRNA namelists or datasets. Obsolete annotations (either due to older miRBase versions or out-dated profiling platforms) can be converted into newer ones enforcing mature sequence correspondence. This project is not further mantained and the application is not available anymore: for any enquire please contact the paper’s main author (see below).

Publications

[preprint] Combinatorial selection of biomarkers to optimize gene signatures in diagnostics and single cell applications
Ferrari I., Mazzara S., Abrignani S., Grifantini R., Bombaci M., Rossi R.L.
bioRxiv 2022.01.17.476603 (2022)
High inter-follicular spatial co-localisation of CD8+FOXP3+ with CD4+CD8+ cells predicts favourable outcome in follicular lymphoma
Hagos Y.B., Akarca A.U., Ramsay A., Rossi R.L., Pomplun S., Moioli A., Gianatti A., Rambaldi A., Quezada S.A., Linch D., Gritti G., Yuan Y., Marafioti T..
Hematological Oncology, April 22 (2022)
Novel interferon-sensitive genes unveiled by correlation-driven gene selection and systems biology
Cheroni C., Manganaro L., Donnici L., Bevilacqua V., Bonnal RJP., Rossi RL, De Francesco R.
Scientific Reports 11, 18043 (2021)
Computation and Selection of Optimal Biomarker Combinations by Integrative ROC Analysis Using CombiROC
Bombaci M., Rossi RL.
In: Brun V., Couté Y. (eds) Proteomics for Biomarker Discovery. Methods in Molecular Biology, vol 1959. Humana Press, New York, NY. (2019)
Big Data: Challenge and Opportunity for Translational and Industrial Research
Rossi RL, Grifantini RM.
Front. Digit. Humanit. 5:13 (2018)
myVCF: a desktop application for high-throughput mutations data management
Pietrelli A, Valenti L.
Bioinformatics btx475 (2017)
CombiROC: an interactive web tool for selecting accurate marker combinations of omics data
Mazzara S, Rossi RL, Grifantini R, Donizetti S, Abrignani S, Bombaci M.
Sci Rep (2017) 7:45477
Normalization of circulating microRNA expression data obtained by quantitative real-time RT-PCR
Marabita F, de Candia P, Torri A, Tegnér J, Abrignani S, Rossi RL.
Brief Bioinform (2016) 17:204-12
miRiadne: a web tool for consistent integration of miRNA nomenclature.
Bonnal RJ., Rossi RL., Carpi D., Ranzani V., Abrignani S., Pagani M.
Nucleic Acids Res (2015) 43:W487-92

BIOINFORMATICS

Activity

Team

Unit Coordinator

Staff

Affiliated members

Equipment

Applications / Software

Publications

INGM RESEARCH IS SUPPORTED BY

NEWS

CONTACTS

AMMINISTRAZIONE TRASPARENTE

BANDI DI GARA