Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Bug]: Missing files for example multimodal_rag_langchain.ipynb #742

Open
1 task done
caetano-colin opened this issue Jun 3, 2024 · 0 comments
Open
1 task done
Assignees

Comments

@caetano-colin
Copy link

caetano-colin commented Jun 3, 2024

File Name

gemini/use-cases/retrieval-augmented-generation/multimodal_rag_langchain.ipynb

What happened?

I was running the example and after downloading the files from the bucket using: !gsutil -m rsync -r gs://github-repo/rag/intro_multimodal_rag/ .

When running the code below:

pdf_folder_path = "/content/data/" if "google.colab" in sys.modules else "data/"
pdf_file_name = "google-10k-sample-14pages.pdf"

# Extract images, tables, and chunk text from a PDF file.
raw_pdf_elements = partition_pdf(
    filename=pdf_file_name,
    extract_images_in_pdf=False,
    infer_table_structure=True,
    chunking_strategy="by_title",
    max_characters=4000,
    new_after_n_chars=3800,
    combine_text_under_n_chars=2000,
    image_output_dir_path=pdf_folder_path,
)

It is not able to find google-10k-sample-14pages.pdf file from the bucket downloaded files. I tried navigating through the github-repo bucket and was not able to find this pdf file.

Relevant log output

[Errno 2] No such file or directory: 'google-10k-sample-14pages.pdf'
PDF text extraction failed, skip text extraction...

Code of Conduct

  • I agree to follow this project's Code of Conduct
@caetano-colin caetano-colin changed the title [Bug]: Missing PDF files for example [Bug]: Missing PDF files for example multimodal_rag_langchain.ipynb Jun 3, 2024
@caetano-colin caetano-colin changed the title [Bug]: Missing PDF files for example multimodal_rag_langchain.ipynb [Bug]: Missing files for example multimodal_rag_langchain.ipynb Jun 3, 2024
@holtskinner holtskinner self-assigned this Jun 4, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants