Background: Several large public repositories of microarray datasets and RNA-seq data are available. Two prominent examples include ArrayExpress and NCBI GEO. Unfortunately, there is no easy way to import and manipulate data from such resources, because the data is stored in large files, requiring large bandwidth to download and special purpose data manipulation tools to extract subsets relevant for the specific analysis. Results: TACITuS is a web-based system that supports rapid query access to high-Throughput microarray and NGS repositories. The system is equipped with modules capable of managing large files, storing them in a cloud environment and extracting subsets of data in an easy and efficient way. The system also supports the ability to import data into Galaxy for further analysis. Conclusions: TACITuS automates most of the pre-processing needed to analyze high-Throughput microarray and NGS data from large publicly-Available repositories. The system implements several modules to manage large files in an easy and efficient way. Furthermore, it is capable deal with Galaxy environment allowing users to analyze data through a user-friendly interface.

TACITuS: Transcriptomic data collector, integrator, and selector on big data platform

Alaimo S.
Primo
;
Di Maria A.;Ferro A.;Pulvirenti A.
Ultimo
2019-01-01

Abstract

Background: Several large public repositories of microarray datasets and RNA-seq data are available. Two prominent examples include ArrayExpress and NCBI GEO. Unfortunately, there is no easy way to import and manipulate data from such resources, because the data is stored in large files, requiring large bandwidth to download and special purpose data manipulation tools to extract subsets relevant for the specific analysis. Results: TACITuS is a web-based system that supports rapid query access to high-Throughput microarray and NGS repositories. The system is equipped with modules capable of managing large files, storing them in a cloud environment and extracting subsets of data in an easy and efficient way. The system also supports the ability to import data into Galaxy for further analysis. Conclusions: TACITuS automates most of the pre-processing needed to analyze high-Throughput microarray and NGS data from large publicly-Available repositories. The system implements several modules to manage large files in an easy and efficient way. Furthermore, it is capable deal with Galaxy environment allowing users to analyze data through a user-friendly interface.
2019
Cloud storage and management; Galaxy; RNA-Seq
File in questo prodotto:
File Dimensione Formato  
tacitus19.pdf

accesso aperto

Tipologia: Versione Editoriale (PDF)
Dimensione 3.34 MB
Formato Adobe PDF
3.34 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.11769/374055
Citazioni
  • ???jsp.display-item.citation.pmc??? 2
  • Scopus 2
  • ???jsp.display-item.citation.isi??? 2
social impact