The Free & Uncensored Version of MidJourney! (FLUX.1)

Name: The Free & Uncensored Version of MidJourney! (FLUX.1)
Uploaded: 2024-08-06T21:44:36.000Z
Duration: 36 min 25 s

Introduction to Flux One

Overview of the New AI Image Generating Tool

A new AI image generating tool called Flux One has emerged, claiming to rival and even outperform Mid Journey in certain aspects.

Developed by a team from Black Forest Labs, many members contributed to the creation of Stable Diffusion and other significant innovations in AI image generation.

Models Available in Flux One

Flux offers three models:

Flux One Schnell: Fastest model for local development; open-source under Apache 2.0 license.

Flux One Dev: Middle-tier model that is more efficient than Schnell but not for commercial use.

Flux One Pro: Top-tier model designed for enterprise solutions with state-of-the-art performance.

Using Flux Models

Accessing the Models

Users can access the models through various websites, including Hugging Face where both Schnell and Dev models are available for free.

The interface allows users to input prompts and adjust settings like random seed, height, and inference steps.

Building Workflows with Glyph

Glyph is a platform that enables users to create custom workflows using Flux models without cost.

Users can enhance their prompts using LLM tools like ChatGPT before generating images with either Flux Pro or Schnell.

Performance Insights from User Experience

Feedback on Model Capabilities

Insights were gathered from an experienced user named Miguel (Angry Penguin PNG), highlighting strengths and weaknesses of Flux.

Illustrating an Angry Penguin: A Comparison of AI Art Models

Exploring Different Artistic Styles

The speaker attempts to create a hand-drawn illustration of an angry penguin, noting that while the image looks good, it lacks the true hand-drawn quality.

An oil painting version is generated; although visually appealing, it does not convincingly represent an oil painting style.

A watercolor painting of the same subject is produced, which captures some watercolor elements but still doesn't fully embody the medium's characteristics.

In comparison with MidJourney's output for similar prompts, MidJourney's results appear more aligned with traditional artistic styles like watercolor and oil painting.

Evaluating Flux Model Capabilities

The speaker discusses Flux’s strengths in realism and aesthetic training, suggesting it produces high-quality outputs comparable to MidJourney when using appropriate prompts.

A photo-realistic image prompt generates a realistic depiction of a man eating ice cream on a city sidewalk, though some details may be misrepresented (e.g., sniffing instead of eating).

Another prompt for a woman taking a selfie on a tropical island yields decent results but lacks ultra-realism due to potential prompt optimization issues.

Simplifying Image Generation Workflows

The speaker demonstrates creating a basic image generator workflow using Flux Pro without optimizing prompts.

A portrait prompt generates an image that appears realistic but falls short compared to MidJourney in terms of realism and context (e.g., "about to give a meeting" isn't visually represented).

Strength in Text-Based Creations

Flux excels at generating images involving text elements such as logos or memes; examples include humorous depictions like George Washington crossing the Delaware with modern twists.

A polar bear holding a sign successfully captures the intended message on the first attempt, showcasing strong adherence to text prompts.

Challenges with Longer Text Prompts

When testing longer text prompts (e.g., plane writing "subscribed"), results vary; initial attempts yield partial success but miss complete adherence to longer phrases.

Subsequent trials show improvement in visual representation but still struggle with capturing all requested text accurately.

Prompt Adherence Comparisons

The discussion highlights how different models handle complex prompts. For instance, MidJourney often misses multiple elements from intricate requests.

While Flux shows promise in adhering closely to detailed prompts (like depicting specific scenarios), its performance remains variable compared to competitors like DALL-E 3 and MidJourney.

AI Art Generation: A Comparison of Models

Overview of Dragon Imagery

The speaker discusses various dragon images, highlighting a normal dragon in cowboy boots eating chips and comparing it to more complex prompts.

Demonstrates the capabilities of Dolly 3, which successfully generates a three-headed dragon eating nachos while wearing cowboy boots and watching TV, showcasing its prompt adherence.

Limitations and Capabilities of AI Models

Mentions that while current models cannot generate not safe for work content, they can theoretically create copyrighted images. An example is given with SpongeBob SquarePants high-fiving Super Mario.

Tests generating celebrity images like Tom Hanks hugging Kanye West; results show some discrepancies but overall recognizable outputs.

Prompting Techniques for Better Outputs

Emphasizes the flexibility of generating various imaginative scenarios without restrictions (e.g., Donald Trump eating a donut).

Discusses methods to enhance prompts using AI tools or specific keywords (like "4K" or "HD") to improve image quality.

Advanced Prompting Strategies

Suggests being as detailed as possible in prompts for better results; mentions users pushing boundaries with creative descriptions.

Highlights an example where typography creatively forms the word "cat," demonstrating the potential of imaginative prompting.

Future Developments in AI Art Generation

Introduces Flux One's upcoming text-to-video capabilities, indicating a shift towards more dynamic content generation.

Compares Flux One's performance against Mid Journey and Dolly 3; notes that while improvements are evident, it hasn't surpassed them yet.

Conclusion on Model Performance and Potential

Concludes that Flux One shows promise by combining strengths from existing models but still has room for improvement in realism and prompt adherence.

Predicts that as Flux evolves, it could integrate features from all leading models, making it a versatile tool for creators.

Notes the open-source nature of Flux will allow developers to refine it further, enhancing its capabilities over time.

New Open Source Model Competing with Stable Diffusion

Excitement for New Developments

The speaker expresses enthusiasm about a new open-source model that appears to outperform Stable Diffusion, indicating a competitive landscape in AI tools.

A follow-up video is planned to share further insights and improvements related to the new model as the speaker learns more.

Acknowledgments and Resources

Gratitude is extended to Miguel, also known as Angry Penguin, for assistance in creating the video and explaining the capabilities of Flux.