Lost In Diffusion

    Lost in data (2024)

    This experiment started when I got my hands on some data (duh). Data in question is the catalogue from the DesignMuseumGent. Plowing through carefully crafted descriptions and professional photos that come with it, I started by finetuning a flux.dev diffusion model, based on a subset (chairs) of the data.

    I used a script to create a dataset for finetuning, matching a picture with its description, using 'llava:34b' for translating the Dutch to English, better suiting the flux model. After handpicking about 33 good image/text pairs I started the finetuning.

    Data in Diffusion

    I also used the image to text capabilities of the same model to generate a description of the image from the databases.. and then I started to use these descriptions to generate an image again, with the LoRa to make it look lkike it could be part of the collection.

    Lost in Diffusion

    some cherry picked examples from the first batch...I'm not (yet) elaborating on the aspects that are (not) intersting, but it's a worthy experiment...what info is lost when you let a human describe an object from a picture and use that description to let an AI model generate an image using a text2img model. The Flow is Text > Translated Text > Image

    Schaalmodel van de zetel 'Capitello'

    picture of real object, photographed by a human
    image diffused using description translated from original by Llava:34b
    The 'Capitello' upholstered chair was designed by Studio65 in 1971, a famous design collective within the Italian anti-design movement of the 1960s. Inspired by classicist architecture, its defining elements include thrones, columns and capitals, all rendered in a playful Pop Art-like form. By deconstructing architectural elite symbols, Studio65 challenged established values and, more specifically, the rigid power position of industry. The actual inspiration for 'Capitello' was a photo of tourists resting on capitals and twisted columns at the Acropolis in Athens. This chair in the shape of an interrupted capital was developed for a competition organized by chemical company DuPont, looking for new applications for its innovative materials. 'Capitello' is entirely constructed from polyurethane foam, with the symmetrical curls of the capitals made using the same mould and then assembled, while the base was cut out from a block of polyurethane foam. It was first presented in 1972 at Eurodomus 4 in Turin, and there is a scale model on a 1:5 scale in the collection of Design Museum Gent.

    Losing my Diffusion

    for the next iteration, I used an image2text model to describe the original image and generated a new image from that description. so the flow is Image > Text > Image

    Zetel uit de eigen woning van de ontwerper

    picture of real object, photographed by a human
    image diffused using description translated from original by Llava:34b
    image diffused from img2txt description generated from original by Llava:34b
    The image shows a chair with an unusual design. It appears to be a mid-century modern piece, characterized by its simple yet bold lines and geometric shapes. The frame of the chair is constructed from wood in a warm brown tone, which contrasts nicely with the upholstered seat cushion that seems to be a textured fabric or leather.

    Eenpersoonszetel

    picture of real object, photographed by a human
    image diffused using description translated from original by Llava:34b
    image diffused from img2txt description generated from original by Llava:34b
    This is an image of a modern chair with a contemporary design. The chair features a curved seat and backrest, both upholstered in a dark fabric that appears to be velvet or a similar plush material. The armrests are angular and made of metal, providing a striking contrast to the softness of the upholstery. The frame is also metallic with a darker finish, possibly black, which complements the rest of the chair's design. The background of the image is neutral, highlighting the chair as the central focus. There are no visible texts or additional objects in the immediate vicinity of the chair. The style of the image is clean and professional, suitable for showcasing furniture design.

    Frame van de zetel 'LC93B'

    picture of real object, photographed by a human
    image diffused using description translated from original by Llava:34b
    image diffused from img2txt description generated from original by Llava:34b
    The image shows a contemporary wooden chair with a modern design. The chair features an asymmetrical backrest, with the right side appearing to be higher than the left, creating a distinctive profile that resembles a stylized letter 'U' or the outline of a human head from the back. The seat is flat and seems sturdy, suggesting comfort for sitting. The wood appears to be light in color, possibly birch or another pale wood, with visible grain patterns adding texture to the piece. The chair is situated against a plain wall which provides no distractions from the item itself, highlighting its design features. There are no visible texts or brands on the chair, and it stands alone without any additional context provided by surrounding objects or setting.
kaotec bv
Londenstraat 40
9000 Gent
Belgium
VAT BE0784540750