Ovis2图片反推提示词工具3.0 基于AI大模型反推图片+视频批量生成提示词 英伟达6G显卡即可愉快玩耍

开源 17 3826

本工具基于Ovis2-4B和Ovis2-2B 这2个大模型制作而成。

其实早之前我就有分享过反推提示词工具 moondream,但是我昨天遇到了这个大模型,发现功能更加强大,反推出来的提示更加丰富,更多细节描述。这个工具要求显卡比较低,英伟达显卡6g及以上就可以愉快玩耍了

我用同一张图片反推提示词

8DA9BBE9A17A482917B462B52B91FAC8.png

下面是Ovis2-4B生成的提示词

The image depicts a young woman standing in a snowy forest. She is wearing a black puffer jacket over a black bikini, creating a striking contrast against the white snow. The woman has long, wavy blonde hair that falls past her shoulders. Her facial expression is neutral, with a slight hint of a smile, and her eyes are looking directly at the camera.

Her posture is relaxed, with her hands casually placed in the pockets of her jacket. The background features snow-covered trees, creating a serene and wintry atmosphere. The lighting appears soft and diffused, typical of an overcast winter day, which enhances the cold and tranquil mood of the scene.

The woman's body shape is slender, and her skin tone is fair. The overall composition of the image, with the contrast between her dark clothing and the white snow, draws attention to her figure and the juxtaposition of her attire against the winter setting. The image captures a moment of contrast between the warmth of her clothing and the cold environment, creating a visually striking and thought-provoking scene.


下面是Ovis2-2B生成的提示词

The image depicts a woman standing in a snowy forest, creating a striking contrast between her attire and the winter setting. She is wearing a black puffer jacket with the hood up, which is open, revealing her black bikini top and black underwear with crisscross straps. Her long, blonde hair cascades down her shoulders, and she has a neutral expression on her face.

The background is a serene winter scene with snow-covered trees, adding to the cold, wintry atmosphere. The lighting is soft and diffused, likely due to overcast skies, which enhances the overall mood of the image.

The woman's posture is relaxed yet confident, with her body slightly turned to the side. Her skin tone is fair, and her body shape appears to be slender. The combination of her black clothing and the snowy environment creates a visually striking contrast, emphasizing her figure and the stark beauty of the winter landscape.


moondream 反推生成的提示词

The image depicts a young woman standing in a snowy forest. She is wearing a black puffy coat and black bikini bottoms. Her blonde hair is styled in loose waves, and she has a neutral expression on her face. The background is filled with snow-covered trees, creating a serene and tranquil atmosphere. The lighting is natural, with sunlight filtering through the trees, casting a soft glow on the scene. The woman's posture is relaxed, with her body angled slightly to the left, and her gaze is directed straight at the camera. The contrast between her black clothing and the white snow adds a dramatic effect to the image. The woman's body shape is slender, and her chest circumference is visible, highlighting her figure.

差距还是有的。所以抽空做了这个反推工具

2025-03-09_09-53-51.png

Ovis2-4B和Ovis2-2B 的区别在于 4b反推的提示词细节更多,但是速度相对慢一点。2b速度快一点,细节没有那么多。我个人觉得其实2b就够用了,但是我觉得4b可能心理感觉更好也就保留下了

因为图片的数量很大的话,那么处理的时间就会差距很大了

简单说下怎么使用。把需要反推的图片放在input 文件夹里面。然后双击启动批量反推图片.exe。选模型1或者2

等待运行完毕就可以了。生成好的提示词文件放在txt文件夹里面。

有了这些大量的提示词后就可以用魔搭 批量AI文生图工具 27pic-api v3.0 无需显卡,无需搭建解压即可运行。来挂机生成大量的图片了。

下面是视频反推出来的提示

1 The video features a woman dressed in a traditional Chinese qipao, a dress with a high collar and a floral pattern, paired with a light yellow blazer. The woman is seen in various poses, with her hair tied back in a ponytail, and she wears long, dangling earrings. The background is a soft pink color with a framed picture visible. Throughout the video, the woman's expressions change subtly, with her eyes looking off to the side and her mouth slightly open, suggesting a range of emotions.

2 The video features a woman in a light blue bikini standing in a hot spring, holding a glass jar. She is surrounded by a bamboo fence and stone edges, creating a serene and natural setting. The woman is seen adjusting her bikini and holding the jar, with water splashing around her. She then holds the jar closer to the camera, showcasing the water inside. The video captures her in various poses, emphasizing the tranquil atmosphere of the hot spring.

3 The video features a woman standing in front of a wooden door, wearing a blue, form-fitting dress with a unique design, one shoulder strap, and a tied belt at the waist. She has long, dark hair and is accessorized with long, matching gloves on one arm. The background is a neutral-colored wall, and the lighting is warm, creating a cozy atmosphere. The woman strikes various poses, showcasing her dress and accessories. Text appears in the top left corner, indicating the source of the video, while the bottom right corner displays the Douyin (TikTok) username '抖音号: 4013595' and the search term '乔乔不熬夜' (Jiao Jiao doesn't stay up late). The video concludes with a screen showing the Douyin profile of the user with the username '抖音号: 4013595' and a search bar with the text '抖音搜索 乔乔不熬夜' (Search on Douyin: Jiao Jiao doesn't stay up late). 

4 The video features a woman performing a series of acrobatic movements on a sports field, dressed in a white and light blue crop top and light gray pants. The background includes a building with orange and white walls, pink banners with white Chinese characters, and a few people sitting and walking around. The woman starts by standing and then begins to move, performing a cartwheel and a backflip. She continues with a handstand and a backbend, showcasing her flexibility and balance. The scene is set in a sunny outdoor environment with green grass and trees in the background. The video concludes with the woman lying on the grass, relaxing and smiling, with a cool emoji appearing above her. The text '王六堡' (Wang Ziqi) and '知乎足' (ZhiQie Foot) appear in the top right corner, indicating the woman's social media handle and the context of the video.

2025-03-09_11-02-46.png

2025-03-09_19-31-15.png视频演示

如果不知道去哪里找大量的图片来反推,可以试试下面这个

https://www.myhelen.cn/pic/

整合包使用说明必看

https://www.myhelen.cn/helen/267.htm

3.0 更新记录

1 部分人启动出错,增加一个环境重新安装批处理文件,如果启动出错可以执行一次

2 修改图片反推工具保存的txt文件名和图片名一致

3 修正部分运行逻辑,速度应该会更快一点

4 清理一部分缓存垃圾的文件 压缩包体积有所减少

5 还是出错就看上面的使用说明

点击查看

下载有疑问看下这里

相关推荐:

我要评论:

◎欢迎参与讨论,请自觉遵守国家法律法规。

已有 17 条评论

  1. 大气的酒窝 大气的酒窝

    win10系统,Ovis2运行正常,但没有推理结果。

  2. 大气的酒窝 大气的酒窝

    win10系统,运行正常,但txt文件夹,没文件。
    1和2均如此

    1. 剑心 剑心

      看看使用说明

  3. 机器猫满意 机器猫满意

    0. AIDC-AI/Ovis2-1B (速度快)
    1. AIDC-AI/Ovis2-2B (速度快)
    2. AIDC-AI/Ovis2-4B (效果好)
    输入模型编号 (默认为 2):2
    Downloading Model to directory: models\models\AIDC-AI\Ovis2-4B
    2025-03-25 23:21:21,034 - modelscope - WARNING - Using branch: master as version is unstable, use with caution
    Loading checkpoint shards: 0%| | 0/2 [00:00

    1. 机器猫满意 机器猫满意

      程序运行失败,错误码:3221225477
      按Enter键继续...我是win11,路径没有中文

      1. 剑心 剑心

        win11 运行不了 请自行研究

    2. 机器猫满意 机器猫满意

      输入模型编号 (默认为 2):2
      Downloading Model to directory: models\models\AIDC-AI\Ovis2-4B
      2025-03-25 23:21:21,034 - modelscope - WARNING - Using branch: master as version is unstable, use with caution
      Loading checkpoint shards: 0%| | 0/2 [00:00

      1. 剑心 剑心

        文末使用说明 仔细看看

  4. dngyng dngyng

    大神,提个建议,推理图片生成的TXT文件名,是否可以改为原图片名称,因为图片多了以后不知道哪个对应哪个

    1. 剑心 剑心

      记录下 以后看看心血来潮的时候 改下

  5. 焕杭 焕杭

    反推图片,提示:
    程序运行失败,错误码:3221225477

    1. 剑心 剑心

      什么操作系统?

      1. 焕杭 焕杭

        WIn10的,第一次运行成功了,后面就不行了,刚才又测试了一下,出现了别的错误,没有代码了。

        1. 剑心 剑心

          路径不要有中文等

  6. 诺言风趣 诺言风趣

    大神,这个反推是不是只能用带人物的图片,还是说,您给的提示词就是以人物为主。我使用了以后发现,只要是不带人的图,反推的提示词第一句话就:I'm sorry, but...,但是,sorry以后,反推的提示词还是能推出图片的内容。不知道是不是我操作的方式有问题。

    1. 剑心 剑心

      只要是图片都可以反推

  7. 伶俐闻手套 伶俐闻手套

    666

只显示最新的15条留言