Fund: [BUG] When `file_type` is set to `all_files`, only `pdfminer` is used

Describe the bug
Hello,

It seems that when the parser module is configured with file_type: all_files, only pdfminer is applied. I have tried using langchain_parser/upstagedocumentparse and llamaparser, and both appear to use pdfminer exclusively. Even when I set the output_format to html, it seems like pdfminer is still being used. Am I mistaken about something?

Below is the YAML file I configured:

- module_type: langchain_parse
  parse_method: upstagedocumentparse
  split: page
  file_type: all_files
  output_format: html

- module_type: llamaparse
  result_type: markdown
  file_type: all_files
  language: ko

And here is the result:

I would appreciate your help. Thank you.

Markr.AI/AutoRAG

[BUG] When file_type is set to all_files, only pdfminer is used

How does funding with Polar work?

Backer

Contributor

Maintainer

Markr.AI/AutoRAG

[BUG] When file_type is set to all_files, only pdfminer is used

How does funding with Polar work?

Backer

Why does "Fund on completion" require GitHub login?

When is the invoice due for "Fund on completion"?

What happens if the issue is never completed?

Do I get any extra benefits by funding?

Do I get progress updates?

Contributor

Do I get a reward?

Is rewards guaranteed?

Maintainer

How can I get funding like this for my open source initiatives?