Yann Ponty

CNRS research Director · Head of AMIBio team
Faculty member of LIX (Comp. Sci. Dept), École Polytechnique

Office 2005 · LIX/Bat. Turing · 1 rue Estienne d'Orves · 91120 Palaiseau · France
+33 1 77 57 80 95 · No Javascript = no email

I am a tenured CNRS research scientist, based at the Computer Science Department (LIX) of Ecole Polytechnique (Institut Polytechnique de Paris, France). I design Bioinformatics methods, usually relying strongly on discrete algorithmic techniques, to expand our common knowledge of Ribo-Nucleic Acids (RNA) and exploit their potential in biotechnology.

My research interests include:

RNA folding, design, and evolution
RNA/RNA, RNA/Proteins, and Proteins/Proteins interactions
Random generation and enumerative combinatorics
Discrete Algorithms, Parameterized Complexity (Dynamic programming!)
RNA visualization

News

Nominated as HDR referent for the IDIA dept (CS&interactions) of IP Paris

Acting as deputy director of LIX (CS Dept@X, IP Paris)

Deeply honored to be elected as member of the ISCB Board of Directors (2025-2027)

Research

Over the past decade, my research interests have been at the intersection of Computer Science, Mathematics and Molecular Biology. On the applied level, I mainly design analytic approaches, efficient algorithms and tools to answer key questions in Bioinformatics, with a special focus on RNA biology. These questions include, but are not limited to:

How to predict RNA structure in the presence of pseudoknots?
What is the prevalence of kinetics within the RNA folding process?
What is the interplay between RNA structure and evolution?
How can a knowledge of the structure of RNA help in the analysis of experiments?
Conversely, how to use coarse-grain experimental data to perform an accurate RNA structure prediction?
How to design RNA sequences that perform predefined functions in vivo?

Some of these questions are related to universal properties of biopolymers. In such cases, they do not necessarily require considering a specific sequence, or overly sophisticated (and complex) energy models. Provided they can be accurately rephrased at such an abstract level, I typically strive to provide (asymptotical) analytic results using standard tools and techniques (generating functions, singularity analysis...) borrowed from the fields of enumerative combinatorics and analytic combinatorics.

More complicated questions may still lend themselves quite nicely to exact resolution through polynomial time/space algorithms, usually based on dynamic programming. The concepts and design principles underlying such tools can sometimes be generalized to some other application contexts in bioinformatics, such as comparative genomics.

Sometimes, the problem turns out to be computationally intractable, or provably hard in the well-defined meaning given to the term by the field of computational complexity theory. In these situations, I try to establish what makes the problem hard, and how to possibly work around the hardness result either by adopting a classic parameterized complexity approach, or by simplifying the model in order to achieve an acceptable tradeoff between expressivity and tractability.

In the (increasingly common) cases where the problem is hard to analyze, or increasingly as a first approach to test exploratory hypotheses, I tend to adopt a probabilistic perspective based on random sampling within adequately controlled distributions, such as the uniform distribution or the Boltzmann distribution.

Publications

Filter publications

By type
By topic
Text

This list is loaded from HAL, and formatted by bibtex-js.
In case something goes wrong: Access static version

Software

RNA Bioinformatics

VARNA

Drawing and editing the RNA secondary structure. Accepts a wide range of documented and illustrated options, and offers rich user interaction.

Collab.: A. Denise@Paris-Saclay Univ.

VARNA

RNANR

Non-Redundant sampling of RNAs secondary structures. Allows for the generation of key landmarks (locally optimal structures) of kinetics landscapes.

Collab.: H. Touzet@Univ. Lille

RNANR

SPARCS

SPARCS is a program to analyze structured and unstructured regions in coding RNA sequences. Uses a random model preserving both the amino acid sequence and the dinucleotide content, enabling a computation of accurate z-score values.

Collab.: J. Waldispühl@Univ. McGill

SPARCS

RNA Design

IncaRNAtion

Design of RNA sequences folding into a target secondary structures with nucleotide distribution. Implements a global sampling approach, which provides more relevant seed sequences.

Collab.: J. Waldispühl@Univ. McGill

IncaRNAtion

RNARedPrint

Design of RNAs with multiple conformations. Uses an FPT dynamic programming scheme to sample sequences under expressive constraints to solve the positive design of RNA.

Collab.: S. Will@TBI Vienna

RNARedPrint

IncaRNAfbInv

Fragment-based design of RNA sequences, combining a (constrained) global sampling strategy for the generation of seed sequences with a flexible local search optimization.

Collab.: J. Waldispühl@Univ. McGill

· D. Barash@Ben Gurion Univ.

IncaRNAfbInv

Sequence analysis

GenRGenS

A toolkit for the random generation of sequences. Supports different classes of models, including weighted context-free grammars, Markov models, ProSITE patterns...

Collab.: A. Denise@Paris-Saclay Univ.

GenRGenS

RNAPyro

Error-correction of RNA sequencing data using secondary structure information.

Collab.: J. Waldispühl@Univ. McGill

RNAPyro

RNA Structural Bioinformatics

DIAL

A web server for the 3D alignment and motifs search of experimentally-resolved RNAs. Captures similarities in sequence content, secondary structure (using RNAView), and 3D rotameric properties (Dihedral angles).

Collab.: P. Clote@Boston College

DIAL

LocalMove

LocalMove solves the discretization (aka best on-lattice fit) for 3D macromolecules. Fits the polymer backbone to a discrete regular set of points (lattice), using a local search (MCMC) based on local moves.

Collab.: P. Clote@Boston College

LocalMove

Personal trajectory

LIX · Ecole Polytechnique · France

CNRS Research Director· Tenured

RNA Bioinformatics · Random Generation · Applied Analytic Combinatorics

October 2020 - Current date

Département d'Informatique · Université Paris-Saclay · France

Habilitation in Computer Science

RNA Bioinformatics · Dynamic Programming · Random Generation · Analytic Combinatorics

May 2020

LIX · Ecole Polytechnique · France

Head of AMIBio Team

RNA Bioinformatics · Random Generation · Applied Analytic Combinatorics

January 2016 - Current date

UMI PIMS/Maths Dept · Simon Fraser University · Canada

CNRS Researcher · Visiting scientist

Comparative Genomics · RNA Bioinformatics · Random Generation

Main local collaborations with Cédric Chauve and Marni Mishna

Septembre 2013 - Septembre 2015

LIX · Ecole Polytechnique · France

CNRS Researcher · Tenured

RNA Design · Random Generation · RNA Bioinformatics

Member of AMIBio team

November 2009 - September 2020

IRIF · Université Paris Diderot · France

ANR Postdoctoral Fellow

Random Generation · Enumerative Combinatorics · Analytic Combinatorics

Supervised by Dominique Rossin and Michèle Soria (LIP6)

November 2008 - October 2009

LIP6 · Sorbonne université · France

Decrypthon Postdoctoral Fellow

Structural Bioinformatics · Protein-Protein interactions

Supervised by Alessandra Carbone (lIP6)

April 2008 - November 2008

Biology Department · Boston College · USA

NSF Postdoctoral Fellow

RNA Bioinformatics · Structural Bioinformatics

Supervised by Peter Clote

October 2006 - April 2008

LRI · Université Paris-Saclay · France

PhD in Computer Science

Bioinformatics · Enumerative Combinatorics · Analysis and Design of Algorithms

Bioinfo team at LRI · Supervised by Alain Denise · Reviewed by Philippe Flajolet and Eric Rivals

October 2003 - October 2006

School of Computer Science · Université Paris-Saclay · France

Master of Science · DEA Algorithmique

Theoretical Computer Science · Enumerative Combinatorics · Analysis and Design of Algorithms

October 2000 - July 2003

Université Paris-Saclay · France

Bachelor of Science

Computer Science · Mathematics · Physics

October 1997 - July 2000

Main service

Some of my activities besides scientific research and teaching:

Head of the AMIBio team@LIX · Since 2016
Animator (with F. Cazals) for Structural Bioinfo axis (MASIM) of the Research network in molecular bioinformatics (GdR BIM) · Since 2014
Elected member (2016-2020) of the conseil de laboratoire@LIX
Scientific advisory board of RNACentral and RFAM databases at EMBL-EBI · Since 2019
Committee Gilles Kahn/SIF award for best French PhD in Computer Science · Since 2018
Scientific counsil of DIM RFSI · Since 2019

I have also kept busy in the past by being:

Elected member (2012-2016) of the comité national du CNRS in Computer Science (Section 6) and Multidisciplinary approaches for the analysis of biological data and systems (CID 51)

Editorial and reviewing activities

Associate editor of Bioinformatics, published by Oxford University Press · Since 2019
Chair/area chair for the program committee of
- CMSR'14
- ISMB/ECCB'21
Program committee member for
- RECOMB-CG'21
- SeqBIM'20
- ISMB'20
- RECOMB-CG'20
- APBC'20
- ISMB/ECCB'19
- ACM-BCB'19
- BICOB'19
- RECOMB'19
- APBC'19
- SeqBio'18
- GIW'18
- ISMB'18
- RECOMB'18
- BICOB'18
- ISMB/ECCB'17
- RECOMB'17
- BICOB'17
- SeqBio'16
- ECCB'16
- BioVis'16
- ISMB'16
- BICOB'16
- SeqBio'15
- WABI'15
- BioVis'15
- ISMB/ECCB'15
- BICOB'15
- ECCB'14
- BioVis'14
- ISMB'14
- BICOB'14
- ISMB/ECCB'13
- JOBIM'13
- BICOB'13
- BICOB'12
- JOBIM'12
- WRSBS'12
- JOBIM'11
Regular reviewer for
Organizer of scientific events:

Teaching

While my main activities are in scientific research, I regularly teach at the graduate and, less recurrently, undergraduate levels.

Université Paris-Saclay · Palaiseau, France

AMI2B Master Program · Second year (M2)

Combinatorial Optimization · RNA Bioinformatics

Since 2009

Infos/resources for the ongoing academic year:

Lecture 1 - Alignments [pdf]
Lecture 3 - Graphs and assembly [pdf]
Lab assigment - Eulerian paths and k-mers assembly [pdf]
RNA structure prediction [pdf]
Enoncé TP - RNA parsing, counting and folding
List of articles for student presentations
1. Alkan C, Karakoç E, Nadeau JH, Sahinalp SC and Zhang K (2006), "RNA–RNA Interaction Prediction and Antisense RNA Target Search", Journal of Computational Biology., mar 2006. Vol. 13(2), pp. 267-282. Mary Ann Liebert Inc. DOI
2. Chikhi R, Limasset A, Jackman S, Simpson JT and Medvedev P (2015), "On the Representation of de Bruijn Graphs", Journal of Computational Biology., may 2015. Vol. 22(5), pp. 336-352. Mary Ann Liebert Inc. DOI
3. Dondi R, Lafond M and Scornavacca C (2019), "Reconciling multiple genes trees via segmental duplications and losses", Algorithms for Molecular Biology., mar, 2019. Vol. 14(1) Springer Science and Business Media LLC. DOI
4. Ferragina P and Manzini G (2005), "Indexing compressed text", Journal of the ACM., Jul 2005. Vol. 52(4), pp. 552-581. Association for Computing Machinery. DOI
5. Hammer S, Wang W, Will S and Ponty Y (2019), "Fixed-parameter tractable sampling for RNA design with multiple target structures.", BMC bioinformatics., April 2019. Vol. 20, pp. 209. DOI
6. Hoffmann S, Otto C, Kurtz S, Sharma CM, Khaitovich P, Vogel J, Stadler PF and Hackermifmmodeuelseüﬁller J (2009), "Fast mapping of short sequences with mismatches, insertions and deletions using index structures", PLoS Computational Biology., Sep 2009. Vol. 5(9), pp. e1000502. DOI
7. Limasset A, Cazaux B, Rivals E and Peterlongo P (2016), "Read mapping on de Bruijn graphs", BMC Bioinformatics., jun 2016. Vol. 17(1) Springer Science and Business Media LLC. DOI
8. Medvedev P, Pham S, Chaisson M, Tesler G and Pevzner P (2011), "Paired de Bruijn Graphs: A Novel Approach for Incorporating Mate Pair Information into Genome Assemblers", Journal of Computational Biology., Nov 2011. Vol. 18(11), pp. 1625-1634. Journal of Computational Biology. DOI
9. Miklos I, Meyer I and Nagy B (2005), "Moments of the Boltzmann distribution for RNA secondary structures", Bulletin of Mathematical Biology., sep 2005. Vol. 67(5), pp. 1031-1047. Springer Science and Business Media LLC. DOI
10. Myers G (2013), "What's Behind Blast", In Models and Algorithms for Genome Evolution. , pp. 3-15. Springer London. DOI
11. Reidys CM, Huang FWD, Andersen JE, Penner RC, Stadler PF and Nebel ME (2011), "Topology and prediction of RNA pseudoknots", Bioinformatics., feb 2011. Vol. 27(8), pp. 1076-1085. Oxford University Press (OUP). DOI
12. Strothmann D (2007), "The affix array data structure and its applications to RNA secondary structure analysis", Theoretical Computer Science., dec 2007. Vol. 389(1-2), pp. 278-294. Elsevier BV. DOI
13. Xu J and Berger B (2006), "Fast and Accurate Algorithms for Protein Side-Chain Packing", J. ACM. New York, NY, USA, jul 2006. Vol. 53(4), pp. 533–557. Association for Computing Machinery. DOI

Sorbonne Universités · France

BIM Master Program · Second year (M2)

RNA Bioinformatics · Structural Bioinformatics

Since 2009

Infos/resources for the ongoing academic year:

Ecole Polytechnique · France

Engineering Program · BSc/MSc

Algorithms and Programming

2009-2015

Fun* stuffs and ramblings

*Well, fun for me, but probably an acquired taste (at best) for you

Neil Gaiman's best tips for survival in an artistic academic career

Even though there are some difference between art and the academia, it is quite striking how some of Neil Gaiman's advise to young artists mirror what I would tell my students (with at least one notable exception... :) ).

The secret to keeping fruitful collaborations [skip to quote]

You get work however you get work, but people keep working [..] because their work is good, and because they're easy to get along with, and because they deliver the work on time. And you don't even need all three... two out of three is fine!

People will tolerate how unpleasant you are if the work is good and you deliver it on time.
People will forgive the lateness of your work if it's good and they like you.
And you don't have to be as good as everyone else if you're on time and it's always a pleasure to hear from you.