Guigó Lab
Computational Biology and Health Genomics
- Group page
- Research lines
- Group members
- Publications
- Datasets
- Software
1988 Ph.D. in Statistics. Universitat de Barcelona. (Spain).
1988-1993 Postdoctoral researcher at the Molecular Biology Computer Research Resource. Dana Farber Cancer Institute, Harvard University (Division of Biostatistics) BioMolecular Engineering Research Center. Boston University and Theoretical Biology and Biophysics Group (Los Alamos National Laboratory).
1994-2010 Investigator at Institut Municipal d’Investigació Mèdica (IMIM). Barcelona, (Spain).
2001-2005 Associate Professor at the Universitat Pompeu Fabra. Barcelona, (Spain).
2001-2022 Coordinator of the Bioinformatics Programme at the Centre de Regulació Genòmica, Barcelona, (Spain).
Since 2005 Professor at the Universitat Pompeu Fabra. Barcelona, (Spain).
See also the Roderic Guigó lab website
News
Earth BioGenome project begins efforts to sequence 40 thousand genomes in the Catalan Countries (18/07/2024)
Sequencing the genomes of all plants, animals, and fungi on Earth—nearly two million known species—can protect biodiversity as we know it. This is the aim of the Earth Biogenome Project (Earth BioGenome Project, EBP), which aims to characterise the genomic biodiversity of species in different regions of the planet.
CRG researchers promote inclusion of local languages in scientific publications (27/06/2024)
The researchers also translated the scientific abstract of the study, a short summary of the research which is usually written in the first pages of a scientific publication, in Catalan.
Expedition begins to chart the invisible biodiversity of Pyrenean lakes (16/04/2024)
Researchers from Spain, Andorra and France have begun efforts to chart the biodiversity of microorganisms living under the surface of 300 different lakes across the Pyrenees.
Epigenome mapped in detail across human body (30/03/2023)
An international research collaboration has addressed this challenge by cataloguing the human epigenome in unprecedented detail, charting the whole collection of epigenomic marks in more than 25 different types of tissues across the human body.
‘Space bacteria’ set to land in Barcelona (16/03/2023)
A new collaboration between the CRG and yuri GmbH, a commercial space biotechnology company in Germany, will bring together researchers to harness the power of microgravity conditions in space and study its impact on human health, cells, and plant biology.
Seasonal and circadian genetic variation charted across the human body (13/02/2023)
A research team at the CRG have published research in PLOS Biology which details how the circadian and circannual cycles influence humans at the molecular level by measuring changes to the activity of genes inside cells across different types of tissues.
CRG joins international efforts to protect human health from effects of harmful chemicals (11/03/2021)
Roderic Guigó will lead the CRG's contributions to the international consortium
CRG signs biodiversity genomic sequencing agreement with Andorra (01/03/2021)
This is the first time the country has been involved in a project of this magnitude
Small but ubiquitous sex differences in gene expression in human tissues linked to body fat, cancer, and birth weight (10/09/2020)
A study published today in Science reveals that biological sex has a small but ubiquitous influence on gene expression in almost every type of human tissue.
Human body mostly composed of just five major groups of cell (30/07/2020)
Scientists around the world have today published the results of the third phase of the Encyclopedia of DNA Elements (ENCODE) in Nature.
A study describes the dynamics of chromatin during organ and tissue regeneration (04/12/2018)
Researchers discover the genes and regulatory elements of their expression that are required during the process of tissue and organ regeneration in the fruit fly. The article appears highlighted in the cover of the Genome Research journal in December.
Roderic Guigó and Toni Gabaldón among the most highly cited researchers in the world (30/11/2018)
Two CRG group leaders are among the most highly cited researchers in the world, according to the list established by Clarivate Analytics.
Roderic Guigó receives the National Research Award (19/10/2018)
Roderic Guigó, coordinator of the CRG Bioinformatics and Genomics Programme, received the National Research Award. The award ceremony took place at the National Theatre of Catalonia in Barcelona on 15th October and it was chaired by the President of the Catalan Government, Mr. Quim Torra.
Gene expression patterns may help determine time of death (13/02/2018)
International team of scientists led by CRG programme coordinator Roderic Guigó shows that changes in gene expression in different tissues can be used to predict the time of death of individuals.
National Research Award 2017 to Roderic Guigó (27/12/2017)
Roderic Guigó, coordinator of the CRG Bioinformatics and Genomics Programme, honored with the highest recognition for research excellence in Catalonia.
A new method accelerates the mapping of genes in the “Dark Matter” of our DNA (06/11/2017)
Scientists at the Centre for Genomic Regulation (CRG) in Barcelona, have developed a new method, which improved the most important catalogue of genes -GENCODE-, including characterization of new genes in the DNA “Dark Matter”.
Genome Editing: Pressing the «Delete» Button on DNA (02/03/2017)
Until recently, genomics was a «read-only» science. But scientists led by Rory Johnson at the University of Bern and the Centre for Genomic Regulation in Barcelona, have now developed a tool for quick and easy deletion of DNA in living cells. This software will boost efforts to understand the vast regions of non-coding DNA, or «Dark Matter», in our DNA and may lead to discovery of new disease-causing genes and potential new drugs.
Spanish scientists sequence the genome of the Iberian lynx, the most endangered felid (14/12/2016)
Genomic analysis of the Iberian lynx confirms that it is one of the species with the least genetic diversity among individuals, which means that it has little margin for adaptation.
The Blueprint project celebrates major manuscript release (18/11/2016)
BLUEPRINT scientists, including researchers at the Centre for Genomic Regulation, release a collection of 25 publications in Cell, Cell Press-associated and other high-impact journals. These are part of a package of 41 publications by the International Human Epigenome Consortium (IHEC) of which BLUEPRINT is a member.
'Blueprint' study of epigenetics of blood cells will serve biology and medicine (07/09/2016)
Researchers from the EU-funded BLUEPRINT project join their international colleagues this week at the 2016 International Human Epigenome Consortium (IHEC) conference in Brussels to report the latest results in understanding blood cell development and blood disease.
CRG researchers contribute to the sequencing of the Turbot genome (09/03/2016)
The first vertebrate to be genetically sequenced in Spain, the Turbot (Scophthalmus maximus), has a much more refined visual system than other fish as it has evolved to adapt to the shortage of light of the seabed. In addition, the fat in its cell membranes are far higher than in other species to withstand the low water temperatures in its habitat.
The mesoamerican bean genome decoded (25/02/2016)
An Ibero-American team of scientists decoded the Mesoamerican variety of the bean genome coinciding with the celebration of the International Year of Pulses, as designated by the United Nations.
Funding acknowledgements
The "VEIS-Valorisation of EGA for Industry and Society" (VEIS-001-P-001647) project is cofunded by the European Regional Development Fund of the European Union in the framework of the ERDF Operative Program of Catalonia 2014-2020, with a grant of 1,951,429.38 EUR, with the support of the Department of Research and Universities of the Government of Catalonia.
The project “MAPAS GENETICOS Y EPIGENETICOS A LO LARGO DE LA FILOGENIA EUCARIOTA” (PID2021-128956NB-I00) is funded by Agencia Estatal de Investigación (AEI), the Ministerio de Ciencia e Innovación and Fondo Europeo de Desarrollo Regional (FEDER). Project PID2021-128956NB-I00 funded by MCIN/ AEI / 10.13039/501100011033 / FEDER, UE
The 'IMPaCT' project, with official reference IMP/00019, has been granted by the ISCIII, under the Infraestructura de Medicina de Precisión asociada a la Ciencia y Tecnología (IMPaCT) - Ciencia de Datos Call, the Spanish Ministry of Science and Innovation, and cofunded with ERDF Funds from the European Union.
Period: 01/01/2021 to 31/12/2023
Ref.: IMP/00019
The project 'Omplint el buit del mercat del single cell', granted under the call from 'Industria del Coneixement (Llavor i Producte), has received funding from the Department of Research and Universities, of the Government of Catalonia
Period: 19/10/2022 to 18/04/2024
The project “GENCODE: comprehensive reference genome annotation for human and mouse” has been granted by the NATIONAL INSTITUTE OF HEALTH – NIH with reference 2U24HG007234-09 .
Period: 14/09/2021 to 30/06/2025
Ref.: 2U24HG007234-09
The project 'Long-TREC: The Long-Reads Transcriptomics European Consortium. The next generation transcriptome biology revealed by single molecule sequencing technologies' has received funding from the European Union's HORIZON-MSCA-2021-DN-01-01 research and innovation programme under the Marie Sklodowska-Curie grant agreement No 101072892.
Period: 01/10/2022 to 30/09/2026
Ref.: Marie Sklodowska-Curie grant agreement No 101072892
The overarching theme of the research in our group is the understanding of the information encoding in genomic sequences, and how this information is processed in the pathway leading from DNA to protein sequences. We are intrigued by the parallelism between computation and the processing of information encoded in biological sequences. The bulk of ouf research currently focuses on the mechanisms underlying RNA production (transcription) and post-processing (splicing), and it is increasingly incorporating high throughput and experimental approaches. We have recently became interested in the investigation of endophenotyes, specifically histopathological images, and we are developing computational methods to related histological features to cellular and molecular phenoytpes (mostly single-cell and bulk tissue transcriptomics) and to organismic phenotypes (i.e. diseases).
Research lines in our group are:
- Gene prediction, genome annotation, comparative genomics
- Identification and evolution of selenoproteins
- Large-scale functional genomics
- Epigenetic regulation of gene expression
- The impact of genetic variation in molecular and intermediate phenotypes
Group Leader
Staff Scientist
Postdoctoral Researchers
PhD Students
Technicians
Staff Scientist
Postdoctoral Researchers
PhD Students
Technicians
Datasets and resources
Pages
gff2ps is a program for visualizing annotations of genomic sequences
gff2ps is a program developed at Fundació Institut Mar d’Investigacions Mèdiques (IMIM) in collaboration with CRG (Roderic Guigó group), to visualize annotations of genomic sequences. The program takes as input the annotated features on a genomic sequence in GFF format, and produces a visual output in PostScript. It can be used in a very simple way, because it assumes that the GFF file itself carries enough formatting information, but it also allows through a number of options and/or a configuration file, for a great degree of customization.
For more information about this software, please click here
This site provides a series of programs for the functional investigation of groups of genes, based on the Gene Ontology resource.
This site provides a series of programs for the functional investigation of groups of genes, based on the Gene Ontology resource.
GRAPE 2.0 provides an extensive pipeline for RNA-Seq analyses.
GRAPE 2.0 provides an extensive pipeline for RNA-Seq analyses. It allows the creation of an automated and integrated workflow to manage, analyse and visualize RNA-Seq data.
The integrative pipeline for splicing analyses
The integrative pipeline for splicing analyses (IPSA); 1) Quantifies splice junctions and splice boundaries; 2) Calculates splicing indices, exon- and intron-centric; 3) analyzes micro-exons and local splice-graph structure
IRBIS is a computational pipeline for detecting conserved complementary regions in unaligned orthologous sequences
meta is a program to produce and to align the TF-maps of two gene promoter regions.
meta is a program is a program developed at Fundació Institut Mar d’Investigacions Mèdiques (IMIM) in collaboration with CRG (Roderic Guigó group) to produce and to align the TF-maps of two gene promoter regions. meta is very useful to characterize promoter regions from orthologous genes, or from co-regulated genes in microarrays, as it reduces the signal/noise ratio in a very significant manner, still detecting the real functional sites.
For more information about this software, please click here
mmeta is a program to produce and to align the TF-maps of multiple promoter regions.
mmeta is a program to produce and to align the TF-maps of multiple promoter regions. mmeta is very powerful to characterize promoter regions from multiple orthologous genes, or from co-regulated genes in microarrays, as it reduces the signal/noise ratio in a very significant manner, still detecting the real functional sites.
overlap is a program that computes the overlap between two sets of genomic features.
overlap is a program that computes the overlap between two sets of genomic features. More precisely it takes two gff files of genomic features as input and for each feature of the first set, says whether it is overlapped by a feature of the second set (basic mode, however more and more precise information can be retrieved).
PATRONUS is a program designed to compute in a very fast way the exact probability of observing a given number of occurrences of a simple motif in a sequence.
PATRONUS (from "PATtern Recognition by Optimized Numerical Universal Scoring") is a program designed to compute in a very fast way the exact probability of observing a given number of occurrences of a simple motif (that is, a continuous word without gaps) in a sequence. Its intended scope is the analysis of very long biological sequences, like chromosomes or whole genomes of complex organisms. The probability is computed on the basis of the Markovian statistics of order m for the sequence, that is the recorded number of the occurrences of all the submotifs of length m + 1 in the sequence. Contrary to what many people believe, computing such a probability for a generic motif is a computationally demanding task, mainly because motifs can overlap in non-trivial ways.
project is a program that projects genomic features onto their sequences.
project is a program that projects genomic features onto their sequences. Please contact Sarah Djebali (sarah dot djebali at crg dot es for any question).