X-Y plot of algorithmically-generated AI art of European-style castle in Japan demonstrating DDIM diffusion steps


Autor:
Přisuzování:
Obrázek je označen jako „Vyžadováno uvedení zdroje“ (Attribution Required), ale nebyly uvedeny žádné informace o přiřazení. Při použití šablony MediaWiki pro licence CC-BY byl pravděpodobně parametr atribuce vynechán. Autoři zde mohou najít příklad pro správné použití šablon.
Formát:
2560 x 1734 Pixel (7454406 Bytes)
Popis:

An X/Y plot of algorithmically-generated AI artworks depicting a European-style castle in Japan, created using the Stable Diffusion V1-5 AI diffusion model. This plot serves to demonstrate the U-Net denoising process, using the DDIM sampling method. Diffusion models algorithmically generate images by repeatedly removing Gaussian noise, step-by-step, and then decoding the denoised output into pixel space. Shown here are a smaller subset of steps within a 40-step generation process.

Procedure/Methodology

These images were generated using an NVIDIA RTX 4090; since Ada Lovelace chipsets (using compute capability 8.9, which requires CUDA 11.8) are not fully supported by the pyTorch dependency libraries currently used by Stable Diffusion, I've used a custom build of xformers, along with pyTorch cu116 and cuDNN v8.6, as a temporary workaround. Front-end used for the entire generation process is Stable Diffusion web UI created by AUTOMATIC1111.

A batch of 512x768 images were generated with txt2img using the following prompts:

Prompt: a (european castle:1.3) in japan. by Albert Bierstadt, ray traced, octane render, 8k

Negative prompt: None

Settings: Sampler: DDIM, CFG scale: 7, Size: 512x768

During the generation of this batch, the X/Y plot was generated using the "X/Y plot" txt2img script, along with the following settings:

  • X-axis: Steps: 1, 2, 3, 5, 8, 10, 15, 20, 30, 40
  • Y-axis: None
Komentář k Licence:
Output images

As the creator of the output images, I release this image under the licence displayed within the template below.

Stable Diffusion AI model

The Stable Diffusion AI model is released under the CreativeML OpenRAIL-M License, which "does not impose any restrictions on reuse, distribution, commercialization, adaptation" as long as the model is not being intentionally used to cause harm to individuals, for instance, to deliberately mislead or deceive, and the authors of the AI models claim no rights over any image outputs generated, as stipulated by the license.

Addendum on datasets used to teach AI neural networks
Artworks generated by Stable Diffusion are algorithmically created based on the AI diffusion model's neural network as a result of learning from various datasets; the algorithm does not use preexisting images from the dataset to create the new image. Ergo, generated artworks cannot be considered derivative works of components from within the original dataset, nor can any coincidental resemblance to any particular artist's drawing style fall foul of de minimis. While an artist can claim copyright over individual works, they cannot claim copyright over mere resemblance over an artistic drawing or painting style. In simpler terms, Vincent van Gogh can claim copyright to The Starry Night, however he cannot claim copyright to a picture of a T-34 tank painted with similar brushstroke styles as Gogh's The Starry Night created by someone else.
Licence:
Credit:
Vlastní dílo
Sdílet obrázek:
Facebook   Twitter   Pinterest   WhatsApp   Telegram   E-Mail
Více informací o licenci na obrázek naleznete zde. Poslední aktualizace: Wed, 14 Feb 2024 13:26:48 GMT

Relevantní obrázky

© VulcanSphere / CC BY 4.0
© VulcanSphere / CC BY 4.0
(c) Machine Vision and Learning Group, LMU Munich, MIT

Relevantní články

Stable Diffusion

Stable Diffusion je model hlubokého učení převádějící text na obraz, který byl uveden na trh v roce 2022 na základě techniky difúze. Je primárně určen k generování podrobných obrázků na základě popisů textu, ale lze jej také použít k dalším úkolům, jako je inpainting, outpainting a generování překladů obrazu k textovému zadání. Byl vyvinut výzkumníky z CompVis Group na Ludwig Maximilian University v Mnichově a Runway, s výpočetním příspěvkem od Stability AI a trénovacími daty poskytnutými neziskovými organizacemi. .. pokračovat ve čtení