Hadi Abdine

1 rue Honoré d'Estienne d'Orves. 91120 Palaiseau, France·
(+33) 6 29 55 41 94·
hadi[dot]abdine[at]polytechnique[dot]edu·

I am Hadi Abdine, I hold a Ph.D. in computer science, data and AI from Institut Polytechnique de Paris. My studies were done at LIX-École Polytechnique with the DaSciM team under the supervision of Prof. Michalis Vazirgiannis. My current research work focuses on natural language processing, pretrained language models and their application. Before joining LIX as a Ph.D. candidate, I graduated with a Master degree in data science from Institut Polytechnique de Paris, an engineering degree in data science from Telecom Paris and an engineering degree in computer science and telecommunication from the Lebanese university, Faculty of Engineering 1.

Dedicated to the field of Natural Language Processing (NLP) and the advancements facilitated by large language models, I am deeply passionate about the intersection of technology and linguistics. My research focuses on diverse NLP applications using transformer-based language models and LLMs. This envolves semantic, political, legal and bioinformatical (e.g. proteins function generation in free text using their 3D structures and amino acid sequences) applications.

News

Experience

École Polytechnique

Teaching

LIX - École Polytechnique

Natural Language Processing Researcher / Engineer

Distributed word representations are popularly used in many tasks in natural language processing to achieve high performance in many NLP tasks. In this project, we crawled a huge French corpus and used it to train static French word embeddings (Word2Vec). These word embeddings achived the highest performance in natural language understanding tasks among all the static French word Embeddings. This work is published in CNIA 2022 [PDF]. All the resources and code are published here.

May 2019 - November 2019

AZM center for Biomedical Research

Biomedical Engineering Intern

In this internship the main objective was designing and developing an ECG monitoring software using Raspberry Pi 3.

July 2018 - September 2018

CodenDot

Android Application Developement Intern

The main obgective of this internship was developing a drawing library, a Face detection tool, and the FaceVerter tool for the social app ”Docomix” using JAVA language.

July 2017 - September 2017

Education

École Polytechnique

Ph.D. in computer science
The era of transformer-based language models has led the way in a new paradigm in Natural Language Processing (NLP), enabling remarkable performance across a wide range of tasks from both fields Natural Language Understanding (NLU) and Natural Language Generation (NLG). This dissertation delves into the transformative potential of transformer-based language models when applied to specialized domains and languages. It comprises four distinct research endeavors, each contributing to the overarching goal of enhancing language understanding and generation in specialized contexts.
December 2020 - Present

Institut Polytechnique de Paris

Master M2 in Data Science
September 2019 - November 2020

Telecom Paris

Engineering Degree in Data Science

  • Eiffel Excellence Scholarship (08/2018 – 08/2020)

August 2018 - Octobre 2020

Lebanese University, Faculty of Engineering

Engineering Degree in Computer Science and Telecommunication
September 2014 - July 2018

Publications

  • Hadi Abdine, Michail Chatzianastasis, Costas Bouyioukos, Michalis Vazirgiannis. 2023. Prot2Text: Multimodal Protein’s Function Generation with GNNs and Transformers. Published in AAAI 224, Spotlight at DGM4H Neurips 2023 and AI4Science Neurips 2023.[PDF][Code & Dataset][Demo]
  • Hadi Abdine, Moussa Kamal Eddine, Davide Buscaldi, Michalis Vazirgiannis. 2023. Word sense induction with agglomerative clustering and mutual information maximization. In AI Open, Volume 4, Pages 193-201.[PDF][Code]
  • Iakovos Evdaimon, Hadi Abdine, Christos Xypolopoulos, Stamatis Outsios, Michalis Vazirgiannis, and Giorgos Stamou (2023). « GreekBART: The First Pretrained Greek Sequence-to-Sequence Model. », Published at LREC-COLING 2024. [PDF][Code][Dataset]
  • Hadi Abdine, Christos Xypolopoulos, Moussa Kamal Eddine, Michalis Vazirgiannis. 2022. Evaluation of Word Embeddings from Large-Scale French Web Content. In Conférence National en Intelligence Artificielle 2022, Saint-Etienne, France.[PDF][Code]
  • Hadi Abdine, Yanzhu Guo, Virgile Renard, Michalis Vazirgiannis. 2022. Political Communities on Twitter: Case Study for the 2022 French Presidential Election. In Proceedings of the Political Natural Language Processing Workshop 2022, Marseille, France.[PDF][Slides]
  • Stella Douka, Hadi Abdine, Michalis Vazirgiannis, Rajaa El Hamdani, and David Restrepo Amariles. 2021. JuriBERT: A Masked-Language Model Adaptation for French Legal Text. In Proceedings of the Natural Legal Language Processing Workshop 2021, pages 95–101, Punta Cana, Dominican Republic. Association for Computational Linguistics.[PDF]

Skills

Programming Languages & Tools
Workflow
  • Training and evaluation of language models and deep Learning models
  • Data crawling and pre-processing
  • Web Development