Damien Guillotin

Hi! I'm Damien.

Welcome to my website.

I'm an engineering student preparing my master degree at ENSEEIHT in Toulouse, France. I am learning computer science and more specifically computer vision, image processing and machine learning.

I'm in a work-study training program, currently working at Polymny Studio as a R&D engineer.

Education

[2020 - 2023] Master's degree in computer science.

ENSEEIHT, Toulouse, France.

ENSEEIHT is a top-tier engineering school in Toulouse, France, that excels in computer science education and research. With a focus on research excellence, ENSEEIHT provides students with state-of-the-art facilities and cutting-edge resources to deepen their understanding of computer science. From software engineering to machine learning, the computer science program at ENSEEIHT prepares graduates to succeed in various fields.

[2018 - 2020] Preparatory classes for engineering schools.

Lycée François 1er, Fontainebleau, France.

Preparatory courses, also known as "Classes Préparatoires aux Grandes Écoles" (CPGE), are intensive two-year programs that prepare students for entrance exams to engineering schools. These courses provide students with a rigorous education in science, mathematics, critical thinking, problem-solving, and analytical skills. With a focus on academic excellence, CPGE programs provide students with the necessary tools to succeed in their future studies.

[2015 - 2018] High school diploma. (Mathematics option with honors)

Lycée Sonia Delaunay, Cesson, France.

Experience

[2022 - Present] R&D engineer. (work-study training)

Polymny Studio, Toulouse, France.

During my work-study training at Polymny Studio, I contributed to several projects aimed at improving the quality of video and audio content.

My main focus was on implementing a deep learning-based solution for background removal using pre-trained networks, which involved adapting the networks to our specific use case.

I also worked on noise reduction and speech enhancement, as well as contributed to the development of the company's back-end using Rust and front-end using Elm programming language.

Additionally, I gained experience in video editing using FFmpeg, an open-source software, to improve the visual quality of the studio's content. My work-study training at Polymny Studio provided me with a solid foundation in video and audio processing, as well as software development and project management.

[Summer 2022] Searcher. (internship)

IRIT (REVA), Toulouse, France.

As part of my internship with the REVA team at IRIT, I worked on the Ornithoscope project, which involved developing an embedded bird detection and tracking system using deep learning on Raspberry Pi.

I gained hands-on experience with various deep learning techniques, including YOLO and EfficientNet, to optimize the system's detection and tracking capabilities. As part of my role, I also conducted consumption and performance analysis, which involved optimizing the system's resource usage to maximize efficiency.

One of the main challenges we faced was dealing with a significant class imbalance in our dataset, and I worked on implementing benchmark models, hyperparameters, and learning rate schedulers to address this issue.

I also developed a bounding box tracking algorithm to improve the accuracy of the system's tracking capabilities.

My internship with the REVA team provided me with valuable experience in the application of deep learning techniques and a solid foundation in optimization and performance analysis.

[Summer 2021] Searcher. (internship)
IRIT (TALENT), Toulouse, France.

During my internship with the Talent team, I was responsible for analyzing a Multiple Choice Questions (MCQ) that ENSEEIHT students had passed when entering the school.

This involved performing statistical analyses, including factorial analysis, tetracoric correlation, distractor analysis, discrimination, item analysis, and item response theory, to gain insights into the performance of the students and the effectiveness of the questions. Through this process, I gained a deeper understanding of how to evaluate the quality of MCQs and identify areas for improvement.

My internship with the Talent team provided me with valuable experience and skills that will be beneficial in my future career endeavors.

Projects

Here are some of the projects I worked on during my studies, internships, and free time.

Lunar Lander

Lunar Lander is a game in which the player controls a spaceship and attempts to land it safely on the surface of the Moon. The game was developped in Python using Pygame for the Gymnasium API.

As someone who has only studied model based reinforcement learning (RL) in class, I wanted to experiment more with deep reinforcement learning and push my understanding of the field.

The actor-critic method is a popular approach in RL that combines the benefits of both value-based and policy-based methods. It involves two separate neural networks, with one (the actor) responsible for selecting actions, and the other (the critic) estimating the value of those actions.

I encountered a number of challenges along the way, including hyperparameter tuning and debugging issues, but with persistence and perseverance, I was able to successfully train and evaluate the algorithm. Through this project, I gained a deeper understanding of deep reinforcement learning.

Semi-Supervised Image Classification

The goal of this project was to classify images using a weakly supervised dataset. The dataset consisted of 18 classes, with only 20 images labeled per class, and 2000 labeled images in total (100 images per class for testing).

To tackle this challenge, we experimented with two different approaches: simCLR and SGAN.

The first approach, simCLR, is a semi-supervised method for learning representations from a large number of unlabeled images. The model consists of a series of transformations that convert the inputs into a continuous learning space, followed by a classifier that predicts the labels of the transformed inputs.

The second approach, SGAN, is a semi-supervised GAN network that uses a generator to produce random images and a discriminator to classify them based on known labels. The goal is to produce images that are both realistic and informative for the classification task.

3D Inverse Problem: MVS using levelset

The goal of this project was to reconstruct a 3D model from a set of images. We had to implement a levelset method to reconstruct the 3D model from a set of images.

The levelset is a function that represents the surface of the object. The levelset function is initialized with a sphere and is then evolved to fit the object.

During this project, we only had the time to implement a 2D proof of concept. The results were promising, but the 3D implementation was too slow to be usable.

This project provided a valuable opportunity to explore the field of 3D reconstruction and gain a deeper understanding of these concepts. A strong state of the art analysis was required to understand the levelset method and its implementation in an inverse problem.

Image Classification

The objective of this project was to develop a deep learning-based classification method for recognizing cars in the mobile game Rocket League Sideswipe, regardless of the accessories equipped.

To acquire the approximately 400,000 images in the dataset, a script was written to automate the customization of the car models and take screenshots on an Android phone. It took approximately 18 hours to compile the dataset.

To prevent overfitting, augmentation was performed using the albumentations library (only for the training data).

A simple convolutional neural network (CNN) was created for the project. Due to the network's nature, it did not require a pre-trained model.

3D Rendering

This project involved creating a 3D rendering engine using C++ and OpenGL. The engine supports basic shapes, transformations, and lighting.

The renderer is capable of rendering a scene with multiple objects, each with their own transformations and materials. It is redering shapes using the Phong reflection model for each mesh face.

It has been made using z-buffering for hidden surface removal and supports basic lighting. There are a lot of improvements that could be made to the renderer, including adding transparency or implementing a more advanced lighting model.

Geometric Modeling description

This project involved creating a camera travelling. The trajectory was computed using b-splines to ensure the trajectory was both smooth and controllable.

To achieve this, we utilized the Spigot API to develop a Minecraft plugin that provided a 3D environment with interactive controls.

This project required a strong understanding of geometric modeling, programming, and software development to successfully create an immersive experience for users. Overall, this project provided a valuable opportunity to explore the intersection of technology, 3D modeling, and game development.

Image segmentation

The goal of this project was to segment images using two methods. The first method was k-means, which is an unsupervised clustering algorithm that partitions the data into k clusters. The second method was mean shift, which is a non-parametric clustering algorithm that assigns each data point to the nearest cluster centroid.

Image segmentation is a fundamental problem in computer vision that involves partitioning an image into multiple segments. This project provided a valuable opportunity to explore this field and gain a deeper understanding of image processing.

3D Mesh Compression for Streaming

This project involved implementing a compression algorithm for 3D meshes. The goal was to reduce the size of the mesh data to be sent over the network, while maintaining a reasonable level of quality.

We had to implement a compression algorithm that could be used to compress a 3D mesh before sending it over the network. The algorithm had to be able to decompress the mesh step by step, so that the user could see the mesh being reconstructed in real time.

The algorithm we implemented was based on the paper MAPS. This paper describes a method for compressing 3D meshes using a combination of ordered vertex decimation based on vertex saliency and retriangulation. As we had to deeply understand the paper to implement the algorithm, we learned a lot about 3D mesh compression and metrics.

Compiler description

This project involved implementing a compiler for a simple programming language. The compiler had to be able to translate the source code into an intermediate representation, which could be, at the end, translated into machine code by the assembler.

The compiler was implemented in OcamL, a functional programming language. The compiler translated the source code in RAT (a simple programming language) into an assembler-like language, named TAM. The compiler was able to verify the syntax of the source code and generate an error message if the code was invalid. Then it translated the source code into TAM code.

This project was a great opportunity to learn about compilers and the various techniques used to translate source code into machine code. We learned about the different stages of compilation, such as lexical analysis, syntax analysis, semantic analysis, and code generation. We also learned about the various data structures used to represent the source code, such as abstract syntax trees and symbol tables.

Avatar Animation

This project involved implementing a system for animating a 3D avatar using a webcam. The goal was to allow the user to control the avatar's movements by moving their head.

We had to implement a system that could track the user's head movements and use them to animate the avatar. The system had to be able to track the user's head movements in real time, and the avatar's movements had to be smooth and natural. To achieve this, we used MediaPipe to obtain the user's head pose.

Then, we used the head pose to animate the avatar. We used homography to map the avatar's head to the user's head each frame. We also used a simple linear interpolation to smooth the avatar's movements.

Web application

This project was inspired by the popular The Million Dollar Homepage. Our website allowed users to purchase pixels on a virtual canvas, with each pixel representing a small piece of digital real estate. Users could buy pixels in any color they wanted, and we provided tools for them to create their own custom designs.

The web project we developed was a fun and exciting endeavor that combined various technologies to create a functional and engaging website. We used JBoss as the backend, a powerful and reliable Java library that provided excellent support for our web application. On the frontend, we opted for a more hands-on approach, creating custom HTML, CSS, and JavaScript code from scratch.

To manage our data, we implemented a MariaDB database server, which allowed us to store and retrieve data efficiently. The server was optimized for performance and could handle a high volume of queries without any issues. To handle the connections between the frontend and the backend, we used servlets, which provided an efficient and reliable communication protocol.

Distributed Data

This project involved implementing a client-server tuple space in Java, where clients could perform various operations such as `read`, `write`, `take`, as well as `try` variants. It was also possible to use an event register to trigger a callback when a specific tuple was added to the space.

The primary challenge of this project was to handle concurrency to ensure that multiple clients could access and modify the shared space simultaneously without conflicts or errors.

Additionally, we had to provide proof of the package's functionality through the use of unit tests and code coverage analysis tools such as Jacoco. Overall, this project required strong programming skills, an understanding of distributed data systems, and the ability to develop effective testing strategies to ensure the package's reliability and functionality.