Man simulates time travel thanks to Stable Diffusion image synthesis

By | December 20, 2022
An AI-generated image of a fictional time traveler named Stelfie during the construction of the pyramids.
Enlarge / An AI-generated image of a fictional time traveler named Stelfie during the construction of the pyramids.

Throughout December, a social media user known as Stelfie the Time Traveller has been crafting a time-hopping travelogue using generative AI. Thanks to Stable Diffusion and fine-tuning, an anonymous artist has created a fictional photorealistic character that he can insert into faux historical photographs set in different eras, such as ancient Egypt or the time of the dinosaurs.

Stable Diffusion is a deep learning image synthesis model that allows people to create fictional scenes using text descriptions called prompts. With an additional technique called Dreambooth, people can insert their own subject or character into scenes generated by Stable Diffusion. It can also be used to insert real people into fictional situations.

So far, “Stelfie” has taken historical selfies during the ice age (being chased by a woolly mammoth), in ancient Egypt (during the construction of the pyramids), in ancient Greece (with the Trojan Horse), hanging out with Leonardo da Vinci (while creating the Mona Lisa), in the old West, while running from a tyrannosaurus rex, and while sailing with Christopher Columbus.

In Log No. 4, Stelfie visited Ulysses and the Trojan Horse.
Enlarge / In Log No. 4, Stelfie visited Ulysses and the Trojan Horse.

The artist behind Stelfie writes social media posts (on Twitter, Instagram, and Reddit) in character as playful dispatches from a 41-year-old time traveler as he visits different locations, for example:

Log #3 – Columbus ship, 1492 DC. “Laaaaand” Now, I don’t wanna brag but… After months of sailing Columbus was lost in the Oceans and well let’s say I may have kinda suggested the correct direction…

The anonymous artist (a self-identified “funny old man”) detailed some of the process he uses to create the images in several Reddit comments: a combination of Stable Diffusion 1.5, a custom AI model for the landscape, and a custom AI model trained on the Stelfie face, which is apparently a fictional person created using Character Creator. He uses “a lot of inpainting,” which means inserting AI-generated imagery into the images to fix errors and sculpt the scene, and each image takes three hours to create.

Some may find Stelfie’s attempts to sell the images as NFTs unpalatable (the tech is controversial), but simulated time travel is arguably a novel entertainment use of technology that can otherwise create deepfakes and potentially deceive others. Using Dreambooth and Stable Diffusion, anyone can now insert a character (or themselves) into fantastic and fictional situations, so we’ll likely see a lot more fictional entertainment scenarios in the near future.

Listing image by StelfieTT

Source