UX Collective

We believe designers are thinkers as much as they are makers. https://linktr.ee/uxc

Follow publication

The state of creative AI: will video producers/editors get superpowers?

Johan van Mol
UX Collective
Published in
8 min readJan 25, 2022

Cameraman with green lines overlay
Photo by Kyle Loftus. Edited by author with MediaPipe Pose Estimation.

AI Experiments In Video Production

Next-level color grading — a.k.a. style transfer

GTA is notorious for violence and purposeful accidents. The photorealism style makes this even scarier.

Shoot the foreground, swap the background

Video showing the swapping process
Video of the swapping process. Image from project page
Video showing the original and the swapped background
Video showing the original and the swapped background. Image from project page video.

If George Lucas decided to move an epic battle from the ice planet Hoth to the deserts of Tatooine, it could be done. It would also allow producers to shoot in suboptimal circumstances and fix it “in post”.

The best of both worlds: Combining CGI with AI

Mocap for the masses

Video showing a football player with a 3D Mocap figure overlayed on top.
Image from the Nvidia blog

Synthetic dance floor

Still a long way off

We will see more AI-driven VFX developments in the next few years, but there are still many obstacles to overcome: first, we need to achieve production-ready quality and then we need to integrate them into the production pipeline seamlessly.

Synthetic Video Gaining Traction

Deep fear

However, there was no fake news explosion. Regular people were mostly using the app to slap their faces on popular movie scenes.

Dancing queen: deepfakes in entertainment

Synthetic video for production

Screenshot of the Synthesia platform showing a slide with a synthetic actor
Screenshot of the Synthesia platform. Image form Synthesia press kit

Using traditional techniques, it would require an actor, a producer, a studio, lighting, a camera, a microphone, shooting, editing, and a couple thousands of dollars to produce a 10-minute video. 10 minutes of synthetic video would cost you 30 USD and a couple of minutes.

Voice clones — a.k.a. deep fake voices

Authentic-sounding voice clones are a lot more difficult to produce than faces. Most voice clones suffer from metallic sound, caused by background noise and echo that is embedded in the voice clone.

After typing in the text in the Descript software, the voice clone reads it out loud. Video by author.

Use cases for synthetic video and voices

Will Video Producers And Editors Get Super-Powers?

What if we had Hollywood-level production capabilities? Small studios should ask themselves. And large studios should ask themselves, what if everyone had our production capabilities? How will we be able to make a difference?

Free

Distraction-free reading. No ads.

Organize your knowledge with lists and highlights.

Tell your story. Find your audience.

Membership

Read member-only stories

Support writers you read most

Earn money for your writing

Listen to audio narrations

Read offline with the Medium app

Written by Johan van Mol

Strategist, creative technologist, design thinker, entrepreneur, writer on technology and humans.

No responses yet

Write a response