2024-10-25-pinterest收藏图片批量url复制到本地提交midjourney文字解析再生图

10月 25 2024 日记 12 分钟读完 (约 1864 字)

第一个景观项目目前推进到了还差树木、花卉，我觉得这需要大量的图例来参考学习，找感觉。光靠文字解说局限性大，且见效慢。我想到的方法是在pinterest上找图丢给midjourney参考生产，一种是带url参考一种是describe获取关键词之后用关键词生产，后者的话自由度更高。我目前设计的程序是每次采用第一组–ar前的关键词去生产图，还没有结合图片url做参考，或许也可以考虑把第二个–ar和第一个–ar之间的内容分两次，那么一个链接理论上如果配套url可以生产3组。配套url我觉得关键词结合一次就好了，图片引导的作用会比较强。

做这几个程序时还是要多依靠chatgpt往xpath方向走，结合一些特殊属性，text()来定位元素。drissionpage本身只支持class和id，其他那些定位什么多属性复合就别去研究了都交给chatgpt出xpath。class属性其实很不稳，经常会有元素共用导致定位不准确。

不用xpath定位我企图通过元素.ele()定位子元素失败了，反过来找父元素居然是通过层级1层1层试出来的，drissionpage貌似不能自己定位多重层级，这用tab.ele结合xpath根本就没这事儿。

pinterest没有设定默认图板的功能，因此我开了一个账号，专门用来搞景观、建筑参考，不设置任何图板，那么保存只能在默认的“个人资料”里堆积，一次量到位后，我在这个链接下操作批量删除所有的pin图即可，pinterest同样没有批量删除pin图便捷全选的功能。这个程序折腾了我大半天，一方面是受困于xpath，一方面是第1轮循环后它总是报原来获取的元素不在页面内，没有大小尺寸什么的，然后程序中断退出，完全就是无解了。后来我想到的办法是重复执行这个程序，抛出异常不让它中断即可，最后也解决了，不过开始几次测试有问题，问题出在sleep的时长，pinterest的响应速度比较老年痴呆，加载页面很慢，这个也不算占用太久，那就每1轮删除多一些等待时间吧。

pinterest批量url复制

  

from DrissionPage import ChromiumPage, ChromiumOptions
from DrissionPage.common import By
from DrissionPage.common import Keys
import time
import os
import sys
import re


do1 = ChromiumOptions().set_paths(local_port=9111, user_data_path=r'C:/Users/A/AppData/Local/Google/Chrome/User Data')
p = ChromiumPage(addr_or_opts=do1)
tab = p.new_tab() 
tab.get('https://www.pinterest.com/tomyu2717/_pins/')
time.sleep(6)

# 找到包含div class="vbI XiG"的元素
container_div = tab.ele('.:vbI')

# 提取所有 div class="Yl- MIw Hb7" 下的 img 标签
img_divs = container_div.eles('.:XiG zI7 iyn Hsu')

# 打开文件写入结果
with open('pinterest图片链接.txt', 'w') as file:
    for img in img_divs: 
        img = img.ele('tag:img')
        src_link = img.attr('src')
        # 使用正则表达式匹配 https://i.pinimg.com/ 后第一个 / 之间的部分并替换为 originals
        modified_link = re.sub(r'(https://i\.pinimg\.com/)[^/]+', r'originals', src_link)
        modified_link = "https://i.pinimg.com/" + modified_link 
        # 写入txt文件
        file.write(modified_link + '\n')

print("图片链接已成功提取并保存到output.txt。")

将pinterest图片url批量提交给midjourney描述并通过文字生图

  

from DrissionPage import ChromiumPage, ChromiumOptions
from DrissionPage.common import By
from DrissionPage.common import Keys
import time
import os
from bs4 import BeautifulSoup
import re

input_text1 = "/des"
input_text2 = "cribe"

input_imagine1 = "/im"
input_imagine2 = "agine"

do1 = ChromiumOptions().set_paths(local_port=9111, user_data_path=r'C:/Users/A/AppData/Local/Google/Chrome/User Data')
p = ChromiumPage(addr_or_opts=do1)
tab = p.new_tab()
tab.get('https://discord-d-com-s-mj3.aiwentu.net/channels/1296492298355478559/1297027207134445630')
time.sleep(6)

# 读取待提交的文本文件
with open('pinterest图片链接.txt', 'r') as file:
    lines = file.readlines()

# 最大重试次数
max_retries = 25

# 循环遍历每一行内容
for line in lines:
    imageurl = line.strip()  # 去除行末的换行符
    if imageurl:  # 确保非空
        retries = 0
        while retries < max_retries:
            try:
                # 找到并填写表单中的输入框（根据具体情况选择定位方式）
                time.sleep(2)
                shurukuang = (By.XPATH, "//div[@contenteditable='true' and @data-slate-editor='true' and @data-slate-node='value']")
                input1 = tab.ele(shurukuang)
                input1.click()
                input1.clear()
                input1.input(input_text1)  # 输入字符串
                time.sleep(1)
                input1.input(input_text2)
                time.sleep(16)
                input1.input(Keys.ENTER)  # 提交

                time.sleep(5)
                menu1 = (By.XPATH, "//div[@class='base_bcc24e']//div[@class='text-md/normal_dc00ef autocompleteRowHeading_bcc24e' and text()='link']")
                linkmenu = tab.ele(menu1)
                linkmenu.click()
                time.sleep(5)

                shurukuang2 = (By.XPATH, "//span[@class='optionPillValue_d4df8b']")
                input2 = tab.ele(shurukuang2)
                time.sleep(1)
                input2.click()
                time.sleep(1)
                input2.click()

                input2.input(imageurl)
                time.sleep(3)
                input2.input(Keys.ENTER)  # 提交
                time.sleep(8)
                tab.refresh()
                time.sleep(8)

                try:
                    # 找到所有的 grid_b0068a 元素
                    for original_link in tab.eles('@class=originalLink_d4597d'):
                        # 检查 href 是否匹配
                        if original_link.attr('href') == imageurl:
                            description_div = original_link.parent(4)
                            embed_description_html = description_div.ele('.:embedDescription_b0068a').html
                            soup = BeautifulSoup(embed_description_html, 'html.parser')
                            description_parts = [span.get_text(strip=True) for span in soup.find_all('span') if span.get_text(strip=True)]

                            # 将所有部分连接起来，确保格式正确
                            full_description = ' '.join(description_parts)
                            full_description = re.sub(r'\s+-\s*', '-', full_description)
                            ar_index = full_description.find('--ar')
                            if ar_index != -1:
                                full_description = full_description[:ar_index].strip()

                            print(full_description)
                            time.sleep(5)
                            shurukuang = (By.XPATH, "//div[@contenteditable='true' and @data-slate-editor='true' and @data-slate-node='value']")
                            input1 = tab.ele(shurukuang)
                            input1.click()
                            input1.clear()
                            input1.input(input_imagine1)  # 输入字符串
                            time.sleep(1)
                            input1.input(input_imagine2)
                            time.sleep(8)
                            input1.input(Keys.ENTER)  # 提交

                            shurukuang2 = (By.XPATH, "//span[@class='optionPillValue_d4df8b']")
                            input2 = tab.ele(shurukuang2)
                            time.sleep(1)
                            input2.click()
                            time.sleep(1)
                            input2.click()
                            input2.input(full_description)
                            time.sleep(3)
                            input2.input(Keys.ENTER)  # 提交
                            time.sleep(250)
                            tab.refresh()
                            time.sleep(55)
                            break  # 如果找到了匹配的 href，可以跳出循环

                except Exception as e:
                    print(f"错误: {e}")
                break  # 成功完成后跳出重试循环
            except Exception as e:
                retries += 1
                print(f"错误: {e}. 重试 ({retries}/{max_retries})...")
                time.sleep(5)

# 关闭浏览器
tab.driver.quit()

批量将pinterest的pin图删除

  

from DrissionPage import ChromiumPage, ChromiumOptions
from DrissionPage.common import By
from DrissionPage.common import Keys
import time
import os
import sys

do1 = ChromiumOptions().set_paths(local_port=9111, user_data_path=r'C:/Users/A/AppData/Local/Google/Chrome/User Data')
tab = ChromiumPage(addr_or_opts=do1)
tab.get('https://www.pinterest.com/tomyu2717/_pins/')
time.sleep(5)

while True:
# 找到包含div class="vbI XiG"的元素
    container_div = tab.ele('.:vbI')

    # 提取所有 div class="Yl- MIw Hb7" 下的 img 标签
    items = container_div.eles('.:Yl- MIw Hb7')

    if not items:
        print("没有找到要删除的元素，程序结束。")
        break

    # 遍历所有找到的元素
    for item in items:
        try:
            tab.actions.move_to(item)
            time.sleep(1)
            item.click()
            time.sleep(12)
            button1 = (By.XPATH, '//button[@aria-label="更多选项"]')
            more_button = tab.ele(button1)
            more_button.click()
            time.sleep(6)
            button2 = (By.XPATH, "//span[contains(@class, 'X8m') and text()='编辑 Pin 图']")
            edit_button = tab.ele(button2)
            edit_button.click()
            time.sleep(6)
            tanchuang1 = tab.ele('.:ZHw XiG XbT _O1 ho- rDA jar CCY')
            button3 = (By.XPATH, "//div[contains(@class, 'RCK') and .//div[text()='删除']]")
            confirm_button1 = tanchuang1.ele(button3)
            tab.actions.move_to(confirm_button1)
            confirm_button1.click()
            time.sleep(2)
            tanchuang2 = tab.ele('.:ZHw XiG XbT _O1 ho- rDA jar CCY')
            confirm_button2 = tanchuang2.ele('.:B1n tg7 tBJ dyH iFc sAJ H2s')
            tab.actions.move_to(confirm_button2)
            confirm_button2.click()
            time.sleep(6)
            pinmenu = (By.XPATH, "//div[contains(@class, 'DUt') and contains(@class, 'XiG')]//div[contains(@class, 'X8m') and text()='Pin 图']")
            pinmenu_button = tab.ele(pinmenu)
            pinmenu_button.click()
            time.sleep(5)
            tab.refresh()
            time.sleep(5)
        except Exception as e:
            print(f"错误: {e}")
            # 继续下一个元素
            continue

        print("一次批量删除已完成。")
        time.sleep(1)  # 每次执行完后等待5秒

print("批量删除已完成。")

#midjourney #景观 #ai人工智能作图 #python

2024-10-25-pinterest收藏图片批量url复制到本地提交midjourney文字解析再生图

评论

Your browser is out-of-date!