Everybody Dance Now

AI-generated keywords: Motion transfer Video-to-video translation Pose extraction Face synthesis Synthetic content detection

AI-generated Key Points

⚠The license of the paper does not allow us to build upon its content and the key points are generated using the paper metadata rather than the full article.

The paper presents a novel method for "do as I do" motion transfer in videos
Utilizes pose as an intermediate representation for video-to-video translation
Predicts two consecutive frames for temporally coherent video output
Developed a separate pipeline for realistic face synthesis within transferred motion sequences
Addressed synthetic content detection with a forensics tool to distinguish between real and synthesized videos
Released an open-source dataset of videos for training and experiments

Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Caroline Chan, Shiry Ginosar, Tinghui Zhou, Alexei A. Efros

arXiv: 1808.07371v2 - DOI (cs.GR)

In ICCV 2019

License: NONEXCLUSIVE-DISTRIB 1.0

Abstract: This paper presents a simple method for "do as I do" motion transfer: given a source video of a person dancing, we can transfer that performance to a novel (amateur) target after only a few minutes of the target subject performing standard moves. We approach this problem as video-to-video translation using pose as an intermediate representation. To transfer the motion, we extract poses from the source subject and apply the learned pose-to-appearance mapping to generate the target subject. We predict two consecutive frames for temporally coherent video results and introduce a separate pipeline for realistic face synthesis. Although our method is quite simple, it produces surprisingly compelling results (see video). This motivates us to also provide a forensics tool for reliable synthetic content detection, which is able to distinguish videos synthesized by our system from real data. In addition, we release a first-of-its-kind open-source dataset of videos that can be legally used for training and motion transfer.

Submitted to arXiv on 22 Aug. 2018

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

⚠The license of the paper does not allow us to build upon its content and the AI assistant only knows about the paper metadata rather than the full article.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 1808.07371v2

⚠This paper's license doesn't allow us to build upon its content and the summarizing process is here made with the paper's metadata rather than the article.

Comprehensive Summary
Key points
Layman's Summary
Blog article

The paper titled "Everybody Dance Now" presents a novel method for "do as I do" motion transfer in videos. The researchers introduce a simple yet effective approach to transferring dance performances from a source subject to a novice target individual. This process involves utilizing pose as an intermediate representation for video-to-video translation. By extracting poses from the source video and applying a learned pose-to-appearance mapping, the researchers are able to generate realistic motion transfer results on the target subject within minutes of them performing standard moves. One key aspect of their method is predicting two consecutive frames to ensure temporally coherent video output. Additionally, they have developed a separate pipeline for realistic face synthesis within the transferred motion sequences. The proposed technique yields remarkably compelling results as demonstrated in accompanying videos. Moreover, the researchers address the issue of synthetic content detection by providing a forensics tool capable of distinguishing between videos synthesized by their system and real data. This tool enhances the credibility and reliability of synthetic content generated through their method. To further promote research and development in this area, the authors have released an open-source dataset of videos that can be legally utilized for training purposes and motion transfer experiments. Overall, "Everybody Dance Now" offers a valuable contribution to the field of computer vision and video processing with its innovative approach to motion transfer in dance videos.

- The paper presents a novel method for "do as I do" motion transfer in videos
- Utilizes pose as an intermediate representation for video-to-video translation
- Predicts two consecutive frames for temporally coherent video output
- Developed a separate pipeline for realistic face synthesis within transferred motion sequences
- Addressed synthetic content detection with a forensics tool to distinguish between real and synthesized videos
- Released an open-source dataset of videos for training and experiments

Summary- The paper talks about a new way to copy movements from one video to another. - It uses body positions as a middle step to change videos. - It guesses what the next two frames in a video will look like for smooth movement. - They made a special process to make faces look real in the copied videos. - They made a tool to tell if a video is real or fake. Definitions- Novel: Something new and different. - Pose: The position of someone's body, like standing or sitting. - Temporally: Related to time or happening over time. - Coherent: Making sense and being logical. - Synthesis: Combining things together to create something new.

The Revolutionary "Everybody Dance Now" Paper: A Breakthrough in Motion Transfer for Videos

The ability to transfer motion from one individual to another has been a long-standing challenge in the field of computer vision and video processing. However, a recent research paper titled "Everybody Dance Now" has presented a novel method that allows for seamless motion transfer in dance videos. This groundbreaking approach utilizes pose as an intermediate representation for video-to-video translation, resulting in realistic and compelling results.

Introduction

The paper begins by addressing the common issue of transferring motion between individuals with varying body shapes and sizes. Traditional methods rely on 3D models or manual keyframe animation, which can be time-consuming and often produce unnatural-looking results. The researchers propose a new technique that simplifies this process by using poses extracted from the source video as an intermediate representation.

Pose-Based Motion Transfer

The core concept behind "Everybody Dance Now" is utilizing pose information to translate movements from one subject to another. The researchers first extract poses from the source video using state-of-the-art pose estimation algorithms. These poses are then used to train a pose-to-appearance mapping model, which learns how different poses correspond to specific movements. Once trained, this model can generate realistic motion transfer results on any target individual within minutes of them performing standard moves. To ensure temporal coherence in the output video, the researchers also predict two consecutive frames based on the extracted poses.

Realistic Face Synthesis

In addition to body movements, facial expressions play a crucial role in dance performances. Therefore, the researchers have developed a separate pipeline for realistic face synthesis within transferred motion sequences. This involves training a generative adversarial network (GAN) on real face images and then applying it to synthesize faces onto target subjects during motion transfer. This additional step enhances the overall realism of their results and makes the transferred videos even more convincing.

Forensics Tool for Synthetic Content Detection

One potential concern with this type of technology is the possibility of creating fake or synthetic content. To address this issue, the researchers have developed a forensics tool that can distinguish between videos synthesized by their system and real data. This tool uses deep learning techniques to analyze various visual cues in the video, such as lighting, shadows, and motion patterns. It has shown high accuracy in detecting synthetic content generated through their method, thus enhancing the credibility and reliability of their results.

Open-Source Dataset for Training Purposes

To further promote research and development in this area, the authors have released an open-source dataset of videos that can be legally utilized for training purposes and motion transfer experiments. This dataset includes a variety of dance performances from different individuals, allowing for diverse training data to improve the generalization ability of their model.

Conclusion

"Everybody Dance Now" presents a revolutionary approach to motion transfer in dance videos using poses as an intermediate representation. The proposed technique yields remarkably compelling results as demonstrated in accompanying videos. Moreover, it addresses concerns about synthetic content detection with its forensics tool and promotes further research with its open-source dataset. Overall, this paper offers a valuable contribution to computer vision and video processing with its innovative method for "do as I do" motion transfer.

Created on 25 Aug. 2025

Assess the quality of the AI-generated content by voting

Score: 0

Similar papers summarized with our AI tools

59.6%

Large-Scale Multi-Character Interaction Synthesis

cs.GR

58.3%

Ergonomic-driven Geometric Exploration and Reshaping

cs.GR

56.9%

Deep Detail Enhancement for Any Garment

cs.GR

55.7%

Per Garment Capture and Synthesis for Real-time Virtual Try-on

cs.GR

52.9%

3D Gaussian Ray Tracing: Fast Tracing of Particle Scenes

cs.GR

51.5%

Iterative $α$-(de)Blending: a Minimalist Deterministic Diffusion Model

cs.GR

50.7%

3D Gaussian Splatting for Real-Time Radiance Field Rendering

cs.GR

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.