Researchers announce new AI-based technology that can create short videos based on single images

Why it matters: Researchers continue to find new ways to leverage artificial intelligence and machine learning capabilities as the technologies evolve. Earlier this week, Google scientists announced the creation of Transframer, a new framework with the ability to generate short videos based on singular image inputs. The new technology could someday augment traditional rendering solutions, allowing developers to create virtual environments based on machine learning capabilities.

The new framework’s name (and, in some ways, concept) are a nod to another AI-based model known as Transformer. Originally introduced in 2017, Transformer is a novel neural network architecture with the ability to generate text by modeling and comparing other words in a sentence. The model has since been included in standard deep learning frameworks such as TensorFlow and PyTorch.

Just as Transformer uses language to predict potential outputs, Transframer uses context images with similar attributes in conjunction with a query annotation to create short videos. The resulting videos move around the target image and visualize accurate perspectives despite having not provided any geometric data in the original image inputs.

Transframer is a general-purpose generative framework that can handle many image and video tasks in a probabilistic setting. New work shows it excels in video prediction and view synthesis, and can generate 30s videos from a single image: https://t.co/wX3nrrYEEa 1/ pic.twitter.com/gQk6f9nZyg

— DeepMind (@DeepMind) August 15, 2022

The new technology, demonstrated using Google’s DeepMind AI platform, functions by analyzing a single photo context image to obtain key pieces of image data and generate additional images. During this analysis, the system identifies the picture’s framing, which in turn helps the system to predict the picture’s surroundings.

The context images are then used to further predict how an image would appear from different angles. The prediction models the probability of additional image frames based on the data, annotations, and any other information available from the context frames.

The framework marks a huge step in video technology by providing the ability to generate reasonably accurate video based on a very limited set of data. Transframer tasks have also shown extremely promising results on other video-related tasks and benchmarks such as semantic segmentation, image classification, and optical flow predictions.

The implications for video-based industries, such as game development, could be potentially huge. Current game development environments rely on core rendering techniques such as shading, texture mapping, depth of field, and ray tracing. Technologies such as Transframer have the potential to offer developers a completely new development path by using AI and machine learning to build their environments while reducing the time, resources, and effort needed to create them.

Image credit: DeepMind

Source link

What's Hot

Elementor #32036

The Redmi Note 13 is a bigger downgrade compared to the 5G model than you might think

Xiaomi Redmi Watch 4 is a budget smartwatch with a premium look and feel

Bring Elden Ring to the table with the upcoming board game adaptation

ONI: Road to be the Mightiest Oni reveals its opening movie

GTA 6 images and footage allegedly leak

Wild west adventure Card Cowboy turns cards into weird and silly stories

7 Reasons Why You Should Study PHP Programming Language

Logitech MX Master 3S and MX Keys Combo for Business Gen 2 Review

Lenovo ThinkPad X1 Carbon Gen10 Review

Lenovo IdeaPad 5i Chromebook, 16-inch+120Hz

It’s 2023 and Spotify Still Can’t Say When AirPlay 2 Support Will Arrive

YouTube adds very convenient iPhone homescreen widgets

Google finishes iOS 16 Lock Screen widgets rollout w/ Maps

Is Apple actually turning iMessage into AIM or is this sketchy redesign rumor for laughs?

MeetKai launches AI-powered metaverse, starting with a billboard in Times Square

The DeanBeat: RP1 simulates putting 4,000 people together in a single metaverse plaza

Improving the customer experience with virtual and augmented reality

Why the metaverse won’t fall to Clubhouse’s fate

How Apple privacy changes have forced social media marketing to evolve

Microsoft Patch Tuesday October Fixed 85 Vulnerabilities – Latest Hacking News

Decentralization and KYC compliance: Critical concepts in sovereign policy

What Thoma Bravo’s latest acquisition reveals about identity management

What is a Service Robot? The vision of an intelligent service application is possible.

Tom Brady just chucked another Microsoft Surface tablet

The best AIO coolers for your PC in 2022

YC’s Michael Seibel clarifies some misconceptions about the accelerator • DailyTech

Researchers announce new AI-based technology that can create short videos based on single images

Getty Images Plunges Into the Generative AI Pool

The Wall Street Journal Ranks The New Elite Universities Based On Value Added

How To Create A Culture That Encourages Employee Advancement

Entrepreneurs, Create A Startup In A Specific City Or Region You Love By Understanding Its Problems

Urbanista brings solar-powered headphone tech to true wireless earbuds

Blizzard and NetEase have canceled development on a World of Warcraft mobile game

Supply‑chain attacks: When trust goes wrong, try hope?

12 Steps To Help Separate Your Brand From The Competition

Elementor #32036

The Redmi Note 13 is a bigger downgrade compared to the 5G model than you might think

Xiaomi Redmi Watch 4 is a budget smartwatch with a premium look and feel

What's Hot

Researchers announce new AI-based technology that can create short videos based on single images

Related Posts