#图像生成

3周前

这种图用什么生成的

#图像生成 #AI #生成技术 #创意设计

4个月前

为什么 nano banana pro 的文字渲染和指令跟随，有如此大的进步？以配图为例，分别是大量汉字的 PPT 和知识漫画，这种级别的文字生成，背后的 why，是真正有趣的地方。 === 对于扩散模型（diffusion model），生成图片的本质是去噪（denoising）。模型学习的是图像在潜空间（latent space）里的概率分布，通过预测逐渐减去噪声，逐步让图像“显形”，把一张图片“画”/“雕刻”出来。 dall·e 2和3、stable diffusion 等都是如此。它们本质上是没文化的“画图机器”，处理的是连续的像素值。对于它们来说，图片里的文字只是像素，就像不懂中文的老外，生成的汉字看起来“像”，但笔画等细节是乱七八糟的，因为它根本不“懂”这些汉字。但是，对于 nano banana pro 和 gpt-image-1 这样的原生多模态模型（token in, token out 的 native multimodal transformer），生成图片的本质已经不再是“绘画”，而是“预测下一个token”。对nano banana pro 来说，画面里的汉字，不论是图片还是文字，实际上是同样的数学向量，是“跨模态”的。它生成的汉字，是在用写文字的逻辑，在“写”图片里的汉字，所以拼写错误率极低。

#Nano Banana Pro #文字渲染 #多模态模型 #token预测 #图像生成

4个月前

试了下 Z-Image-Turbo 图1-2，4张选最好的1张展示在下方 Z-Image-Turbo 速度生成很快，小字文字渲染一般图3-4是 Nanobanana Pro 生成的效果两者对比，是有点欺负 Z-Image-Turbo 了，Nanobanana Pro 价格是 Z-Image-Turbo 价格的 30倍

#Z-Image-Turbo #nanobanana Pro #图像生成 #性价比 #技术评测

神奇小喷菇AIGC

4个月前

nano banana pro 风靡全球大家停不下来生成图像层出不穷的想象力以下是6个商业应用方向的示例👇

nanobanana平台助力个人形象照生成，专业形象照引发热议· 107 条信息

#Nano Banana Pro #图像生成 #想象力 #商业应用 #积极

科技混子Chris

4个月前

📌 Nano Banana 6 个免费入口 · 速查版（收藏不迷路） 1️⃣ Google AI Studio（官方最稳） 🔗 右上角：选择 Nano Banana 输入提示词 / 上传参考图 ⭐ 官方最高质量入口 2️⃣ Gemini 🔗 模型切换：Gemini 2.5 Flash 模式：Create images ⭐ 出图快，可写文案+图一起生成 3️⃣ LMArena 🔗 顶部：Direct Chat 模式：image（自动调用 Nano Banana） ⭐ 最无脑入口 4️⃣ Lovart 🔗 点击：新建项目右侧模型：Nano Banana ⭐ 适合项目式大量出图 5️⃣ Flowith（送 1000 积分） 🔗 选：图片/视频生成模式模型：Gemini 2.5 Flash Image ⭐ 每月免费 ≈ 33 张 6️⃣ OpenRouter 🔗 模型：Google: Gemini 2.5 Flash Image Preview 点击：Chat 即可使用 ⭐ 多模型对比必备

nanobanana平台助力个人形象照生成，专业形象照引发热议· 107 条信息

#nano banana #Gemini 2.5 Flash #AI Studio #图像生成 #免费入口

小樱💞｜实用工具分享

4个月前

🚀 Nano Banana 2.0 这哪里是升级，这是掀桌子！真的做到了：一图生世界我生成的 18 个绝佳案例 ⬇️ 🏗️ 建筑图 → 实物渲染图 👗 小红书穿分析图 📷 照片光影调节 📐 股定理动图 / 原理解说图 📱 产品架构图…… 全部收录在：

nanobanana平台助力个人形象照生成，专业形象照引发热议· 107 条信息

#Nano Banana 2.0 #AI #图像生成 #升级 #案例

歸藏(guizang.ai)

4个月前

Lovart 和 Nano Banana Pro 在复杂图像任务上省事很多啊比如真实图片和动漫人物混合生成、今天搞的从地址生成真人打卡照都能一次出而且我还在 Lovart 上搞出了不输 NotebookLM 的 PPT 生成提示词！他们目前 Lovart 免费，顺便教一下咋用就不会误操作消耗积分 👇是提示词和教程：

nanobanana平台助力个人形象照生成，专业形象照引发热议· 107 条信息

#Lovart #Nano Banana Pro #图像生成 #免费 #PPT生成

4个月前

すごいインプ、、、みんなもやりたかったんですね！プロンプト置いておきます。お好きなキャラクターでどうぞ(少し実写になりがちかも) ---------------------- Based on the uploaded reference character, generate a live-action scene inside a wide Tokyo girl’s apartment — a bright, lived-in one-room that feels almost 1LDK in scale. White walls, warm wooden floor, beige curtains, a low bed with soft bedding, a desk with cosmetics, bookshelves, plants, a standing mirror, a rug, scattered personal items, and a compact kitchen area in the back. The room must have strong depth with clear foreground, mid-ground, and deep background layers. Place around thirty identical versions of the same character (same face, hairstyle, outfit as the reference) throughout the room, each in a different action or interaction. Vary distance, scale, height, and visibility so the density feels natural. Foreground (very close to camera / partial occlusion): - characters walking past the lens, slightly out of focus - a hand or shoulder entering frame - one leaning close toward the camera - one half-visible behind a large plant - one sitting directly in front, tying her hair - one kneeling by the desk adjusting objects Mid-ground (main room area): - one stretching beside the bed - one sitting on the bed checking her phone - one lying belly-down across the bed - one reaching under the bed - one organizing cosmetics on the desk - one flipping through a book on the shelf - one standing in front of the mirror - one crouching on the rug - one leaning against the wall - one looking out the window - one adjusting the curtain - one carrying laundry - one drinking from a cup - one tidying pillows - one sitting on the floor eating snacks - one doing a small jump or motion blur gesture - one moving a small chair Background (deep perspective / near kitchen and hallway): - one standing near the stove drinking water - one opening a cabinet - one sitting on a stool - one leaning in a doorway - one walking toward the hall - one silhouette partly hidden behind the fridge - one reaching up to a high shelf - one standing far by the entrance area - one barely visible through the hallway frame - one sitting on the floor near the kitchen rug Ensure strong layered occlusion: foreground characters partially block mid-ground ones, and background characters appear smaller with natural perspective falloff. Scatter the thirty characters organically, avoiding symmetry or grid alignment. Lighting is soft natural daylight from the window, consistent across all characters for full integration. Placed within a live-action background that matches the illustration’s posture and composition — while faithfully preserving the illustrated texture and style. Realistic lighting, depth of field, and subtle filming effects are applied to blend the illustration seamlessly with the real environment.

AI视频井喷：Midjourney领跑，多模态混战· 337 条信息

#AI #图像生成 #东京女孩公寓 #角色扮演 #实景融合

4个月前

另一个 Gemini Nano Banana Pro 仍然搞不定的问题是复杂的光学比如下面这个 prompt：画一个玻璃酒杯，里面有小半杯红酒。一个年轻女性端着酒杯凝视着红酒，酒杯的杯壁上倒映出女性的脸。感觉需要世界模型的突破才行。

#Gemini Nano #光学问题 #AI局限性 #世界模型 #图像生成

4个月前

我写篇文章要做一些配图，用Nano Banana Pro。突然想到 NewYorker封面的风格，结果它真的真的给我输出了。疯了，疯了。

nanobanana平台助力个人形象照生成，专业形象照引发热议· 107 条信息

#Nano Banana Pro #NewYorker封面 #AI #图像生成 #积极

4个月前

这张图是通过1500个字的提示词绘制的 Nano Banana 2 太强了。。。。提示词见回复

nanobanana平台助力个人形象照生成，专业形象照引发热议· 107 条信息

#AI绘画 #Nano Banana 2 #提示词 #图像生成 #技术惊叹

4个月前

炸裂！Nano Banana Pro的中文生成能力有多强？现在Nano Banana Pro中文生成能力已经不是缺点了，甚至是优势了。图一：给这句古诗词配图“落霞与孤鹜齐飞，秋水共长天一色”。图二：生成蜡笔小新和小白在《清明上河图》的一角卖大福的场景图三：生成孙悟空和林黛玉的合照。图四：大漠孤烟直，长河落日圆。给这句诗配图

nanobanana平台助力个人形象照生成，专业形象照引发热议· 107 条信息

#Nano Banana Pro #中文生成 #古诗词配图 #AI #图像生成

Jesse Lau 遁一子

4个月前

随手用ai studio build一个app，输入一个图片，生成2张自然头像用我的id图片测试，最开始生成了一个老外，增加一下提示好了。

#AI Studio #图像生成 #自然头像 #AI头像生成 #提示词调整

UNICORN⚡️🦄

5个月前

我又在玩AI修改图片风格就一起把12种主要风格全部整理出来了给AI一张图，再给下面提示词，就OK了，通用 🎨 一、动画与插画风格 1. 吉卜力风格（Studio Ghibli Style）关键词： Studio Ghibli style, soft lighting, lush nature, watercolor texture, hand-drawn, warm tone, nostalgic atmosphere 增强词： Hayao Miyazaki, fantasy village, gentle wind, sunlight through trees, detailed background 适用场景：自然场景、童话故事、治愈系插画 2. 迪士尼风格（Disney Style）关键词： Disney animation style, expressive characters, cinematic lighting, vibrant colors, detailed hair and fabric 增强词： Pixar 3D render, soft glow, fairytale lighting, emotional expressions 适用场景：角色肖像、动画剧照风格 3. 日式动漫风格（Anime Style）关键词： Japanese anime style, clean lines, flat shading, big eyes, dynamic pose, colorful background 增强词： Makoto Shinkai lighting, sunset sky, cherry blossoms, school uniform 适用场景：动漫人物、插画、轻小说封面 4. 西方漫画风格（Comic Book Style）关键词： Western comic style, bold lines, halftone texture, strong contrast, dynamic action 增强词： Marvel, DC, superhero, exaggerated anatomy, dramatic composition 适用场景：英雄主题、动作画面 🖌️ 二、经典绘画风格 1. 印象派（Impressionism）关键词： Impressionist painting, visible brushstrokes, light reflections, pastel colors, soft focus 增强词： Claude Monet, morning light, water reflections, garden scenery 适用场景：风景画、自然主题 2. 超现实主义（Surrealism）关键词： Surrealist painting, dreamlike, symbolic composition, floating objects, distorted perspective 增强词： Salvador Dali, melting clock, endless desert, subconscious imagery 适用场景：概念艺术、象征性创作 3. 波普艺术（Pop Art）关键词： Pop art, vibrant colors, bold outlines, comic dots, repetition, 1960s aesthetic 增强词： Andy Warhol, Roy Lichtenstein, retro advertising, speech bubbles 适用场景：平面设计、潮流插画 4. 立体主义（Cubism）关键词： Cubism style, geometric abstraction, fragmented shapes, muted palette 增强词： Pablo Picasso, multiple perspectives, analytical geometry 适用场景：抽象艺术、海报设计 💫 三、现代与幻想风格 1. 赛博朋克（Cyberpunk）关键词： Cyberpunk city, neon lights, rainy street, reflections, futuristic outfit, holographic ads 增强词： Blade Runner style, cybernetic implants, night city, blue and pink lighting 适用场景：城市夜景、未来角色设定 2. 蒸汽朋克（Steampunk）关键词： Steampunk design, brass and copper, Victorian fashion, gears and steam, mechanical wings 增强词： retro machinery, skyship, London fog, fantasy engineer 适用场景：机械幻想、角色设定 3. 梦幻少女风（Fantasy Girl / Lolita / Pastel Style）关键词： dreamy pastel colors, elegant dress, lace, soft glow, kawaii, magical atmosphere 增强词： Lolita fashion, fairy dust, gentle eyes, sakura petals 适用场景：角色肖像、浪漫插画 4. 像素艺术 / 低多边形（Pixel / Low-poly）关键词： Pixel art, 8-bit style, retro game aesthetic, blocky texture 或 Low-poly art, geometric simplicity, flat shading, minimalist 3D 适用场景：游戏设计、复古风 5. AI 混合新媒体风格（AI-Generated Hybrid Style）关键词： AI art style, mixed media, abstract texture, photo-realistic + painterly fusion, glitch aesthetic 增强词： dreamcore, vaporwave, neural style transfer, surreal composition 适用场景：实验艺术、视觉概念、封面设计

AI视频井喷：Midjourney领跑，多模态混战· 337 条信息

#AI绘画 #风格迁移 #艺术风格 #提示词 #图像生成

5个月前

喜欢和菜头最近文章的配图风格，参考样式让 Gemini 生成了几张

Google Gemini 2.5发布引发AI模型性价比热议· 475 条信息

OpenAI新德里发布会：ChatGPT语音翻译功能引发热议· 869 条信息

#和菜头 #文章配图 #Gemini #图像生成 #风格参考

5个月前

我现在可以确定了，只要穿着衣服，Grok Imagine可以让人什么动作就什么动作，它还帮你自动匹配表情。

#Grok Imagine #动作 #表情 #AI #图像生成

5个月前

一张图生成AI女友，动作表情都可圈可点。改天写个教程。

AI视频井喷：Midjourney领跑，多模态混战· 337 条信息

#AI女友 #人工智能 #图像生成 #教程 #科技

5个月前

Google AI Plus 限时优惠新订阅者前6个月，可享5折优惠 Google AI Plus 包含更高的图像生成和编辑模型 Nano Banana 的使用限额，以及在 Gemini 应用、Flow 和 Whisk 中更多地使用视频生成功能。还可以获得 Gmail 和 Docs 等应用中的内置 Gemini使用， NotebookLM 访问权限，200 GB 存储空间等。

Google Gemini 2.5发布引发AI模型性价比热议· 475 条信息

OpenAI新德里发布会：ChatGPT语音翻译功能引发热议· 869 条信息

#Google AI Plus #限时优惠 #图像生成 #Gemini #NotebookLM

5个月前

接近复刻了，换了 Midjourney 模型后直出 ⬇️

#midjourney #AI模型 #图像生成 #复刻 #技术

5个月前

Sora2想看什么自己生成

AI视频井喷：Midjourney领跑，多模态混战· 337 条信息

#Sora2 #AI #文本生成 #图像生成 #科技

6个月前

Sora 2来了！

#Sora 2 #人工智能 #科技 #创新 #图像生成

𝔽𝕣𝕠𝕤𝕥 𝕄𝕚𝕟𝕘

6个月前

挺准的，等会，我也没放我头像啊，这画的头像着实不错。 From YouMind

#头像 #YouMind #绘画 #AI #图像生成

6个月前

上个月，谷歌发布了 Nano Banana，自称“最先进的图像生成和编辑模型”。我试用后，感觉确实很强，而且免费使用。网友发现了这个模型的各种神奇用法，有人甚至收集成了一个 Awesome 仓库。我从这个仓库里面，挑了几个很实用的例子，分享给大家。

#谷歌 #nano banana #图像生成 #免费 #Awesome 仓库

6个月前

Nano Banana厉害之处和潜在问题速度：平均生成时间2-4秒，比如部分基准测试生成一张1024px仅2.3秒。一致性：多次编辑，角色准确率高达95% 竞争表现：LMArena盲测，胜率达到70%，GenEval分数为0.89。优于Flux Kontext（45%胜率）和DALL-E 3（0.76 GenEval）效率：用先进的Token压缩技术，将图像数据压缩至约1300个，这是低价（0.04美元一张）高速的原因。文本渲染：在图像正确渲染文本，行业领先。提示词保真与编辑：多步骤提示词表现卓越，能对现有图像编辑且无需遮罩。场景完整性、光照和构图等异常出色。 ## 潜在问题可靠性问题：模型有时会无法执行Prompt，而直接返回原图，某些情况下失败率接近50% 。伪影与质量下降：模型有时会引入一层“轻微的模糊层”，降低图像的清晰度。 AI生成的常见问题也都有：手部变形等。尤其当主体物不处于中心位置或背景复杂时，图像质量可能会下降。特定弱点：尽管整体真实感出色，但处理精细面部特征时，与Qwen等竞对比，稍显逊色。

#nano banana #图像生成 #AI模型 #潜在问题 #速度快

6个月前

SPRO：扩散模型优化腾讯混元开源的训练方法。能优化扩散模型生成图片的质量和偏好。优点是计算量小、训练速度快、没有过拟合的问题。项目地址： Github：

#SPRO #扩散模型 #腾讯混元 #开源 #图像生成