The document extractor can't extract the text content from the Excel file. #15878

ywf · 2025-03-14T15:27:54Z

Self Checks

This is only for bug report, if you would like to ask a question, please head to Discussions.
I have searched for existing issues search for existing issues, including closed ones.
I confirm that I am using English to submit this report (我已阅读并同意 Language Policy).
[FOR CHINESE USERS] 请务必使用英文提交 Issue，否则会被关闭。谢谢！:）
Please do not modify this template :) and fill in all the required fields.

Dify version

1.0.0

Cloud or Self Hosted

Self Hosted (Docker)

Steps to reproduce

I'm creating a workflow in Dify. I hope to use the document extractor to read the text content in an Excel file, but the "text" read out is empty.

pload "文档提取器读取excel失败.jpg" -->

✔️ Expected Behavior

The document extractor can be used to read the text content of an Excel file.

❌ Actual Behavior

No response

ywf · 2025-03-14T15:29:17Z

ywf · 2025-03-14T15:30:05Z

Text can be read from Word and TXT files. However, for Excel files, it doesn't work even when I try other files.

dosubot · 2025-03-14T15:34:48Z

I found a similar unsolved discussion titled File Extractor Import Excel Cannot Read is displayed as emptyHelp which might be related to your issue ^[1]. Additionally, there was a closed issue titled The LLM does not get the contents of the document extractor correctly, which was resolved by a pull request that fixed Unicode escape characters ^[2].

For your specific problem, ensure that the necessary dependencies for the ExcelExtractor are installed and updated. These include pandas, openpyxl, and xlrd ^[3]. If these are already in place, check if there are any configuration settings or conditions in your workflow that might be preventing the ExcelExtractor from executing, such as an empty variable_selector ^[4]. Additionally, review any logs for error messages that might provide insights into the failure ^[5].

To continue talking to Dosu, mention @dosu.

^{Help Dosu learn! Give it feedback: Great Response | Irrelevant Answer | Incorrect Sources | Too Verbose | Hallucination | Bug Report | Other}

dosubot bot added the 🐞 bug Something isn't working label Mar 14, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The document extractor can't extract the text content from the Excel file. #15878

The document extractor can't extract the text content from the Excel file. #15878

ywf commented Mar 14, 2025

ywf commented Mar 14, 2025

ywf commented Mar 14, 2025

dosubot bot commented Mar 14, 2025

The document extractor can't extract the text content from the Excel file. #15878

The document extractor can't extract the text content from the Excel file. #15878

Comments

ywf commented Mar 14, 2025

Self Checks

Dify version

Cloud or Self Hosted

Steps to reproduce

✔️ Expected Behavior

❌ Actual Behavior

ywf commented Mar 14, 2025

ywf commented Mar 14, 2025

dosubot bot commented Mar 14, 2025