Coarse Is Better

(borretti.me)

92 points | by _dain_ 3 hours ago ago

47 comments

  • raincole 2 hours ago ago

    It's ridiculous lol.

    Midjourney is optimized for beautiful images, while Nano Banana is optimized for better prompt adherence and (more importantly) image editing. It should be obvious for anyone who spent 20 minutes trying out these models.

    If your goal is to replace human designers with cheaper options[0], Nano Banana / ChatGPT is indefinitely more useful than Midjourney. I'd argue Midjourney is completely useless except for social media clout or making concept art for experienced designers.

    [0]: A hideous goal, I know. But we shouldn't sugarcoat it: this is what underpin the whole AI scheme now.

  • airstrike 36 minutes ago ago

    I'm no image gen expert but these prompts are downright terrible even by my standards.

    Are you really complaining that ", from the British Museum." leads to it a painting in the actual British Museum? Just remove the sentence, and you'll be fine. Now good luck trying to make Midjourney place the image at the museum!

    I'm a paying MJ user and am impressed by Nano Banana. They're different models. They each serve their purpose.

    This analysis is just noise. Yawn.

    Ironically, even an LLM with its fake reasoning capabilities can point out the issue with the prompts if you ask it to critique this article.

  • dleeftink 21 minutes ago ago

    Eno applies:

    > It's the sound of failure: so much modern art is the sound of things going out of control, of a medium pushing to its limits and breaking apart. The distorted guitar sound is the sound of something too loud for the medium supposed to carry it. The blues singer with the cracked voice is the sound of an emotional cry too powerful for the throat that releases it. The excitement of grainy film, of bleached-out black and white, is the excitement of witnessing events too momentous for the medium assigned to record them.

  • andy99 an hour ago ago

    You’re definitely on to something, people wouldn’t criticize as much as they are otherwise, they’d ignore it.

    I think the whole point is that in optimizing for instruction following and boring realism we’ve lost what could have been some unique artistic elements of a new medium, but anyway.

  • pornel 2 hours ago ago

    The author is using special prompts exploiting flaws of the old models, and doesn't like that new models interpret the hacks literally instead.

    The new models have prompt adherence precise enough to distinguish what "British Museum" or "auction at Christie's" is from the art itself, instead of blending a bag of words together into a single vector and implicitly copying all of the features of all works containing "museum" or "ArtStation" in their description.

    • RHSeeger an hour ago ago

      The prompts bothered me a lot, too. I don't do a lot of work with AI, but

      > A painting sold at Sotheby's

      and

      > A painting in the style of something that would be sold at Sotheby's

      convey very different meaning (to me).

  • amram_art 14 minutes ago ago

    The problem is not in the image models rather the training data and its context. "British museum" for MJ is the image source, "British museum" is the setting for Nano Banana.

  • airza 3 hours ago ago

    Years of refinement on the taste of people with no taste has produced a model with no taste. Crazy

    • drob518 2 hours ago ago

      I tasted the model, but then I spit it right back out.

      • mcpeepants an hour ago ago

        they put a special coating on the model to discourage this behavior

  • spaceman_2020 2 hours ago ago

    While I don’t disagree with the author, these are simply two completely different tools with different use cases. Nano Banana Pro throws out fantastic images you can actually use in your marketing right away. It’s not an art tool - it’s a business tool

    As long as the older tools still exist to make art, I don’t see what the problem is. Use NBP to make your marketing pics, MJv2 for your art

  • TrueDuality 40 minutes ago ago

    I love the inherent wonder and joy in this post around the original images.

  • delis-thumbs-7e 32 minutes ago ago

    Just fucking by canvas, brushes and good quality oil paint. You need only five colours[1]. Cost you maybe 50-80 euros. And any mess you produce will give you more joy thanand shot produced by any clanker brain. Keep at it for few years, take evening classrs, look tutorials and you have learned yourself a skill. You can now travel to any majos art museum across the world and have a discussion with masters through their works hanging on the wall.

    And you will also see how fucking sad and inferior all these ai images are. Really, trust me, please. There is more to art than this. There is more to life.

    [1] https://www.youtube.com/watch?v=f7F67FsLaaY

  • only-one1701 an hour ago ago

    AI doesn’t make art. The OP is trying to fit the square peg of their intuitive understanding about the art creation process into the round hole of generating it via AI

    • jellyroll42 18 minutes ago ago

      Correct! The process and struggle of creation is a large part of what makes art art. Removing friction from the process makes something artless.

      • card_zero 7 minutes ago ago

        Yes, but: when I was young I used to love photorealism and hyperrealism, which is super-smooth-and-shiny art that conceals its process in order to awe simpletons. Then I bought an airbrush, and then true color computer graphics happened, and soon after that I began to appreciate brush strokes and the texture of pen marks and the idea of the personality of the artist's hand. But that doesn't mean the process-hiding stuff is non-art, or even bad art. What's wrong with creating an amazingly convincing illusion, wasn't that always the goal, historically? Also there are no prizes for effort, and if your artwork is only struggle, I don't want to see it. Unless you're really badass about it.

  • chrismsimpson an hour ago ago

    Is some kind of MoE or routing (but for image models obviously), depending on the prompt ask, a possible solve?

  • nickelpro 2 hours ago ago

    The author has succeeded only in arguing one meaningless image factory produces images they find more aesthetically pleasing than another meaningless image factory.

    The framing implies they understand little of art at all; beyond gurgling and clapping like a child at the colors and shapes they find most stimulating.

    • kakapo5672 an hour ago ago

      It seems we have found the One True Artist on this thread, the gatekeeper and judge for all that is worthy. Humble obedience in thy presence.

      • jellyroll42 11 minutes ago ago

        Someone pushing back against a provincial and fundamentally incorrect definition of art is not gatekeeping.

    • brantmv 2 hours ago ago

      Why say this in such a rude way?

      • dwb 2 hours ago ago

        Because powerful interests are trying to hijack human creative pursuits in the interest of profit. None of the images in the post are art.

    • cluckindan 2 hours ago ago

      Found the zealot.

      Is true art a hermetic endeavour which must be gate-kept to seal out the lesser folk?

      If so, then why lambast the lesser folk over their ignorance of the secret knowledge?

      • zelphirkalt 26 minutes ago ago

        I don't think it is some secret. There are many who say that art is not just a painting itself, but in the process of making it, and the motivation and goals behind it. Generative "AI" has none of that. It does not labor like a human would. It has no motivation, because it is not a thinking being. It has no intention in making a digital output. It just works. It has no meaning by the process of creating. Some Michelangelo working on something amazing for years, that's something that has meaning.

        It is also not inventive. It's rehashing and regurgitating. That point is a bit muddy, because many humans do that too. But ask a generative "AI" to make something better than what it has learned from and new, and you will probably be disappointed.

        I am not an art buff, but I can sort of see, why one wouldn't consider it proper art.

      • nativeit an hour ago ago

        > Is true art a hermetic endeavour which must be gate-kept to seal out the lesser folk?

        Kind of. If everyone on the planet can paint the Sistine Chapel’s ceiling, then it’s not anything special anymore is it? Especially if it reduces the process to asking the world’s most prolific counterfeit machine to do it for you.

        • graemefawcett 22 minutes ago ago

          Besides, if everyone could paint the Sisten Chapel, then we'd have works equivalent to the Sistene Chapell everywhere.

          Why is that a problem?

          That to me sounds like the opposite of a problem.

          Used effectively, these tools are elevators, enhancing the capabilities of everything they touch.

          Telling them to paint you a picture results in the word you envision.

          Painting a picture with them is how you see mine

        • graemefawcett 37 minutes ago ago

          Is art then just the outcome? The artifact that was produced?

          What's your criteria then for who is allowed to produce art? If allowing everyone to create it lessens its value such that it becomes worthless, there must be a cutoff.

          If your goal is to ensure the continuity of human expression, limiting who is allowed to create art and narrowly defining art to great works kind of misses the point.

      • only-one1701 an hour ago ago

        People are aren’t entitled to get entry into every space they want to with no effort!

    • andy99 an hour ago ago

      Well, birthdays are merely symbolic of how another year's gone by and how little we've grown. No matter how desperate we are that someday a better self will emerge, with each flicker of the candles on the cake we know it's not to be. That for the rest of our sad, wretched, pathetic lives, this is who we are to the bitter end. Inevitably, irrevocably. Happy birthday? No such thing.

  • smurda 39 minutes ago ago

    Another word for coarse is impasto technique, where the paint is so thick the painting-knife or brush strokes are visible and leave a pronounced texture (e.g. Van Gogh, Rembrandt).

    Another cool prompt could be specific painting techniques (e.g. pencil shading, glaze) as if you were training an actual artist in a specific technique.

  • BoredPositron 28 minutes ago ago

    The OP would likely prefer Disco Diffusion if they want their art to remain coarse. Modern models possess advanced spatial understanding and adhere strictly to prompts, whereas the OP is using unstructured inputs better suited for older models with CLIP or T5 encoders that lack that spatial awareness. These legacy prompting styles are incompatible with Gen3 models that utilize VLMs as text encoders. If the OP wants to explore modern architecture, they should use Flux.2 with a LoRA or perhaps a coarser model like Zit if they prefer to rely solely on text conditioning. Nano Banana Pro requires extremely long and distinctive prompting to achieve specific aesthetics. His blog post shows a lack of understanding and a lack of adaption to modern architecture which would be fine if it wasn't that dismissive.

  • Demiurge 2 hours ago ago

    I don’t see splashes of primary color as more artistic. Anyway, what if you just ask it “more coarse”? I see impressive depth in the latest outputs, but as with all technically proficient performers, you might just have to consciously scale it back.

  • Zak 2 hours ago ago

    The author claims the old models are better at creating art than the new ones. I disagree; art requires consciousness and intent while this type of model is capable of neither.

    • CuriouslyC 42 minutes ago ago

      I'm pretty sure people have created images via random physical processes, then selected the best ones, and people have called it "art." That's no different than cherry picking AI generated images that resonate. The only difference is the anti-generative AI crusade being spearheaded by gatekeepers who want to keep their technical skills scarce in their own interests.

      • zelphirkalt 17 minutes ago ago

        I think one could still point out a little difference: Random physical processes do usually not involve mix and matching millions of other people's works. Instead, something new in every aspect and its origin can emerge.

        It feels like AI art is often just a version of: "I take all the things and mix them! You can't tell which original work that tree is taken from! Tiihiiihi!"

        Where "tree" stands for any aspect of arbitrary size. The relationship is not that direct, of course, because all the works gen AI learns from kind of gets mixed in the weights of edges in the ANN. Nevertheless, the output is still some kind of mix of the stuff it learned from, even if it is not necessarily recognizable as such any longer. It is in the nature of how these things work.

    • LatencyKills 2 hours ago ago

      I define art as something that evokes an emotion or feeling. I’ve seen people wax poetic about the ”meaning” of an imagine only to find out that the image was created synthetically.

      Were those “feelings” not authentic?

      • zelphirkalt 13 minutes ago ago

        I don't think it is about the feelings or emotions evoked in the observer. At least not in that generality. It only is, if there is an intention in the creating process of the art, that aims at evoking the emotions or feelings. Otherwise going by the more general definition, many everyday objects become art. Home becomes art. The way to the office becomes art, even if it completely sucks.

      • neonnoodle 2 hours ago ago

        If I see a cloud in the shape of my childhood dog and start to cry, is the cloud art?

        • rtldg 2 hours ago ago

          Yes. The Earth and its formations are art. I disagree that art requires consciousness and intent, but those admittedly do improve its value [to me]. (For reference, I value AI content/art poorly and avoid it)

          • only-one1701 an hour ago ago

            Everything is art, fantastic. I see nothing wrong with this definition.

            • card_zero an hour ago ago

              We have at least established that very boring pieces, such as Andy Warhol's Empire, Kazimir Malevich's White on White, and John Cage's As Slow As Possible, are not art.

              • only-one1701 20 minutes ago ago

                Bad code is still code. A painting of code is not code.

      • only-one1701 an hour ago ago

        Is a car crash art?

        • RHSeeger an hour ago ago

          A drawing/painting of a car crash certainly can be

          https://www.etsy.com/listing/4329570102/crash-impact-car-can...

          As can a photo of one (sorry, I don't have a good example of that).

          And, both a camera and AI are an example of "using a tool to create an image of something". Both involve a creator to determine what picture is created; but the tool is central/crucial to the creation.

          • only-one1701 15 minutes ago ago

            I would never argue that a painting of a car crash couldn’t be art. It’s funny your bringing up that a camera is a tool for creating art; I also hold photographic art in lower esteem than other kinds of visual art (though I still think some kind of photography can be art).

            At a certain point, we need to be realistic about the amount of effort involved in artistic creation. Here’s a thought experiment: someone puts two paintings in a photocopier and makes a single sheet of paper with both paintings. Did that person create art? They certainly had the vision to put those two specific paintings together, and they used a tool to create that vision in reality!

          • card_zero 33 minutes ago ago

            When I was about 12 a car crashed in my quiet street (somebody tried to drive it through a concrete fence), so the next day I sat in the street and did an ink drawing of the wreckage with a mapping pen nib. That was excellent art. Then I stole one of the gigantic suspension springs and took it home to use as a stool, which by some silly definitions was also an act of art. But this all evades the original question about whether the actual car crash is art for evoking feelings, or whether art in fact must involve pictures, or human communication, or what. It's one of the impossible definitions, along with "intelligence" and "freedom". I'm a fan of "I know it when I see it".

        • card_zero an hour ago ago

          Perhaps it has to be a more sophisticated emotion, such as feeling tired of a hackneyed definition.