FEATURE OF GENERATIVE ARTIFICIAL INTELLIGENCE
The last decade has brought to the attention of the general public the revolutionary realm of Artificial Intelligence (AI).
If interested, a deeper understanding in the subject of Artificial General Intelligence (AGI), and Artificial Super Intelligence (ASI) refer to this article that elaborates on these, and an opinion on their social influence going forward:
https://www.publish0x.com/naykdhodlr/artificial-intelligence-to-be-feared-or-not-xmyxwkp
The most common available for public use, a language model AI, produced by Open AI known as Chat GPT. Since its introduction, has passed through several iterations each gaining a larger 'cloud' of universal knowledge fueling its algorithm to respond to `prompt' queries put to its resolve.
The subject to this post is related to Generative Artificial Intelligence (GAI), the progression of GAI's capabilities in the realm of ART in all its format, medium forms: graphics, photography, visual arts, videography, etc...
GAI specifically, a Text-to-Image Synthesis Model is characterized by its ability to generate original images from natural language descriptions, or prompts.
The following lists the latest Text-to-Image Synthesis Model:
- BiLSTMS on color generation: A model specifically designed for color generation, using Bidirectional Long Short-Term Memory (BiLSTM) networks.
- alignDRAW (2015): One of the first modern text-to-image models, extending the DRAW architecture with a recurrent variational autoencoder and attention mechanism.
- StackGAN (2016): A generative adversarial network (GAN) architecture for text-to-image synthesis, combining the strengths of ProGAN and StackGAN.
- AttnGAN (proposed): An Attentional Generative Adversarial Network for fine-grained text-to-image generation, utilizing attention-driven, multi-stage refinement.
- Stable Diffusion (2022): A state-of-the-art text-to-image model, part of the Stable Diffusion architecture, capable of generating high-quality images from text descriptions.
- DALL-E 2 (2022): A text-to-image model developed by OpenAI, known for its ability to generate photorealistic images from text prompts.
- Imagen (2022): A text-to-image model developed by Google Brain, capable of generating high-quality images from text descriptions.
- Midjourney (2022): A text-to-image model known for its ability to generate artistic and photorealistic images from text prompts.
- Dzine (formerly Stylar.ai): A text-to-image model known for its ability to generate artistic and photorealistic and video images from text prompts.
- Leonardo: A text-to-image model known for its ability to generate artistic and photorealistic images from text prompts.
These models have been developed and refined over the years, with some achieving state-of-the-art results in text-to-image synthesis tasks. They utilize various techniques, such as attention mechanisms, generative adversarial networks, and diffusion models, to generate high-quality images from text descriptions a.k.a.: prompt.
Being keen to explore the capabilities of these various Models, a chance given by short-term free access to challenge their capacities; the following are image samples from four of those listed above:
DALLE-2.ai Prompt: 'a decades of humanity's existence pass into millennia, unfortunately, it's collective intelligence is superseded by its demonstrative stupidity.'
MIDJOURNEY.ai Prompt: 'Star Wars prince of evil advancing a realm of violent fear'
LEONARDO.ai Prompt: 'Create the image of a unique automobile design incorporating the iconic features of both Porsche and Ferrari'
Dzine.ai In this instance rather than exploring what could be `prompt' generated, is to take an image from Midjourney and Leonard, to measure its ability to enhance and AI prompt generated image:
Midjourney.ai:
Dzine.ai:
LEONARDO.ai:
Dzine.ai:
As can be observed the clarity, alteration and realism from one to another AI model by comparison is much improved.
To illustrate the opportunity the Dzine.ai model affords the enhancement of photographed images and have them illustratively, creatively altered is astonishing to say the least:
Photograph: Street Graffiti Art
Dzine.ai: Revision
These are just few of the many images either uniquely created from a worded prompt, alteration or variation from an existing image be it an AI generated, or original photograph. Dzine.ai too, has the opportunity to create a video stream generated from a worded prompt as yet to be explored.
For the community of Artists of all genre, and medium; these generational tools are both a curse and a blessing for reasons that are self-evident. That said, the challenge is to look for new opportunities to exploit the advantages Generative Artificial Intelligence affords going forward as it obviously, is just the beginning.