-
-
Notifications
You must be signed in to change notification settings - Fork 12.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
xlsx,pdf均会分块失败 #7060
Comments
Thank you for raising an issue. We will investigate into the matter and get back to you as soon as possible. |
📦 Deployment environmentDocker 📦 Deployment modeServer-side mode (lobe-chat-database mirror) 📌 Software version1.73.0 💻 System environmentOther Linux 🌐 BrowserEdge 🐛 Question descriptionxlsx, pdf will fail in chunking 📷 Reproduction stepsNo response 🚦 Expected resultsNo response 📝 Supplementary informationNo response |
正常的,不支持xlsx,pdf不记得了,文档里有写要集成Unstructured,不过我到现在还没找到咋集成 |
Normal, does not support xlsx, I don't remember pdf, I wrote in the document to integrate Unstructed, but I haven't found how to integrate |
感觉文件向量化处理这块,lobechat明显不如cherry studio。 |
It feels that lobechat is obviously not as good as cherry studio in file vector processing. |
PDF必须是做过OCR处理的才能分块成功,你可以试试先用ocrmypdf在本地处理一下再上传分块 |
PDF must be processed by OCR before blocking successfully. You can try to use ocrmypdf to process locally before uploading chunking. |
📦 部署环境
Docker
📦 部署模式
服务端模式(lobe-chat-database 镜像)
📌 软件版本
1.73.0
💻 系统环境
Other Linux
🌐 浏览器
Edge
🐛 问题描述
xlsx,pdf均会分块失败
📷 复现步骤
No response
🚦 期望结果
No response
📝 补充信息
No response
The text was updated successfully, but these errors were encountered: