Profile Picture of Guy Yariv

Guy Yariv

I am a Research Scientist Intern at Meta (GenAI) and a PhD student in Computer Science at the School of Computer Science and Engineering, Hebrew University of Jerusalem, under the joint supervision of Yossi Adi and Sagie Benaim.

I spent the summer of 2024 as a Research Scientist Intern at Meta (GenAI) and worked as an AI Researcher at Spot by NetApp from winter 2022 to summer 2024.

My research interests include machine learning and generative AI. I’m passionate about achieving full controllability in media generation.


Publications

TTM.

Through-The-Mask: Mask-based Motion Trajectories for Image-to-Video Generation

CVPR 2025
Guy Yariv, Yuval Kirstain, Amit Zohar, Shelly Sheynin, Yaniv Taigman, Yossi Adi, Sagie Benaim, and Adam Polyak

We propose Through-The-Mask, a two-stage framework for Image-to-Video generation that uses mask-based motion trajectories to enhance object-specific motion accuracy and consistency, achieving state-of-the-art results, particularly in multi-object scenarios.

vLMIG's method.

Improving Visual Commonsense in Language Models via Multiple Image Generation

Arxiv Preprint
Guy Yariv, Idan Schwartz, Yossi Adi*, and Sagie Benaim*

We improve large language models' visual commonsense by generating multiple images from text prompts and integrating them into decision-making via late fusion, boosting performance on visual commonsense reasoning and NLP tasks.

TempoTokens.

Diverse and Aligned Audio-to-Video Generation via Text-to-Video Model Adaptation

AAAI 2024
Guy Yariv, Itai Gat, Sagie Benaim, Lior Wolf, Idan Schwartz*, and Yossi Adi*

We propose a method to generate realistic, audio-aligned videos by adapting a text-to-video model with a lightweight adaptor.

AudioToken.

AudioToken: Adaptation of Text-Conditioned Diffusion Models for Audio-to-Image Generation

InterSpeech 2023
Guy Yariv, Itai Gat, Lior Wolf, Yossi Adi*, and Idan Schwartz*

We adapt text-conditioned diffusion models for audio-to-image generation by encoding audio into a token compatible with text representations.

Contact

Feel free to reach out:
guyyariv.mail at gmail dot com