Bio

I am an AI Scientist specializing in computer vision, multimodal models, and generative AI. I currently work as an Applied Scientist at Amazon Berlin, where I develop large-scale AI systems for customer-facing applications including virtual try-on, image translation and Prime Video sports highlights.

I received my PhD in Electrical Engineering from the Doctoral School of Electrical Engineering at École Polytechnique Fédérale de Lausanne (EPFL), Switzerland, under the supervision of Dr. Jean-Marc Odobez in the Perception Group at Idiap Research Institute. During my PhD, I interned at Google and Amazon working on deep learning and computer vision.

I obtained an MSc in Computer Science from the Center for Research in Mathematics (CIMAT), during which I completed a research internship at the LAAS-CNRS, France. Before that, I worked as a software engineer at Intel.

Publications

I'm interested in computer vision, machine learning, and applications for human-computer interaction.

Pose Transformers preview Pose Transformers animation

Pose Transformers: Human Motion Prediction with Non-Autoregressive Transformers

Angel Martínez-González, Michael Villamizar, Jean-Marc Odobez

IEEE/CVF International Conference on Computer Vision, 2021

paper · project page · poster

A new non-autoregressive Transformer architecture to predict 3D human motion and activities.

Hourglass pushing preview Hourglass pushing animation

An Efficient Image-to-Image Translation HourGlass-based Architecture for Object Pushing Policy Learning

Marco Ewerton, Angel Martínez-González, Jean-Marc Odobez

IEEE/RSJ International Conference on Intelligent Robots and Systems, 2021

paper · project page

An object pushing policy based using an Hourglass-based CNN architecture.

Residual Pose preview Residual Pose animation

Residual Pose: A Decoupled Approach for Depth-Based 3D Human Pose Estimation

Angel Martínez-González, Michael Villamizar, Olivier Canévet, Jean-Marc Odobez

IEEE/RSJ International Conference on Intelligent Robots and Systems, 2020

paper · project page · video

A new 3D pose estimation method that relies in residual pose modeling suitable for multi-person scenarios.

WatchNet++ preview

WatchNet++: Efficient and Accurate Depth-Based Network for Detecting People Attacks and Intrusion

Michael Villamizar, Angel Martínez-González, Olivier Canévet, Jean-Marc Odobez

Machine Vision and Applications (MVAP), 2020

paper · video · BibTex

Detecting people from top-view images for attack detection in security airlocks.

Efficient CNN preview Efficient CNN animation

Efficient Convolutional Neural Networks for Depth-Based Multi-Person Pose Estimation

Angel Martínez-González, Michael Villamizar, Olivier Canévet, Jean-Marc Odobez

IEEE Transactions on Circuits and Systems for Video Technology, 2020 (accepted Nov. 2019)

paper · data · project page

Leveraging lightweight CNN, domain adaptation and knowledge distillation for 2D pose estimation.

Domain adaptation preview Domain adaptation animation

Investigating Depth Domain Adaptation for Efficient Human Pose Estimation

Angel Martínez-González, Michael Villamizar, Olivier Canévet, Jean-Marc Odobez

ECCV Workshop on Human Behavior Understanding, 2018

paper · data

Investigates domain adaptation to close the covariance shift gap from learning with synthetic data.

WatchNet preview

WatchNet: Efficient and Depth-based Network for People Detection in Video Surveillance Systems

Michael Villamizar, Angel Martínez-González, Olivier Canévet, Jean-Marc Odobez

IEEE International Conference on Advanced Video and Signal-Based Processing (AVSS), 2018

paper

People detection from top-to-down facing images for security systems.

Real-time convolutional networks preview Real-time convolutional networks animation

Real-time Convolutional Networks for Depth-based Human Pose Estimation

Angel Martínez-González, Michael Villamizar, Olivier Canévet, Jean-Marc Odobez

IEEE/RSJ International Conference on Intelligent Robots and Systems, 2018

paper

Introducing a new CNN architecture for real time pose estimation from synthetic depth images.

Face detection preview

Real Time Face Detection Using Neural Networks

Angel Martínez-González, Victor Ayala Ramirez

IEEE Mexican International Conference on Artificial Intelligence, 2011

paper

Skin color-based searching and face detection with neural networks.

Projects

DepthHuman preview

DepthHuman: A tool for depth image synthesis for human pose estimation

We have created the DIH dataset, a large scale dataset of synthetic depth images with annotations for depth-based 2D pose estimation with this tool.

code · dataset

ViZDoom preview

ViZDoom and Reinforcement Learning

Solving navigation tasks in a 3D FPS game environment for autonomous agents with deep reinforcement learning methods.

project page

Monte Carlo localization preview

Montecarlo Localization

When an autonomous drone needs to localize itself in a map, motion models and particle filters come to place to save the day.

code

ML & AI Notes

Personal notes and summaries on topics I am interested in.

  • Notes on Diffusion Models — A summary of the forward process, reverse process, training objective, and sampling intuition.