Skip to content

UnstructuredDirectoryLoader uses ๐Ÿฆœ๏ธ๐Ÿ”— LangChain <langchain_community.document_loaders.unstructured> UnstructuredFileLoader to load files like '.txt', '.csv', '.pdf', '.msg' into a List[Document] using ๐Ÿฆœ๏ธ๐Ÿ”— LangChain <langchain_core.documents> Document

License

Notifications You must be signed in to change notification settings

priyankt3i/UnstructuredDirectoryLoader

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

11 Commits
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 

Repository files navigation

What is UnstructuredDirectoryLoader?

  • UnstructuredDirectoryLoader uses ๐Ÿฆœ๏ธ๐Ÿ”— LangChain <langchain_community.document_loaders.unstructured> UnstructuredFileLoader to load files like '.txt', '.csv', '.pdf', '.msg' into a List[Document] using ๐Ÿฆœ๏ธ๐Ÿ”— LangChain <langchain_core.documents> Document

How to install

The Python version used when this was developed was 3.9 ๐Ÿ˜ฎ

pip install UnstructuredDirectoryLoader

How to use

  1. Just pass you local directory path in UnstructuredDirectoryLoader() and then use this directory loader.

    directory_loader = UnstructuredDirectoryLoader(directory_path='*<Your_Local_Folder_Path>*')
    
    listDocs = directory_loader.load()
    
  2. Done! Go ahead and use a text splitter on this docs

    text_splitter = SemanticChunker(HuggingFaceEmbeddings())
    

About

UnstructuredDirectoryLoader uses ๐Ÿฆœ๏ธ๐Ÿ”— LangChain <langchain_community.document_loaders.unstructured> UnstructuredFileLoader to load files like '.txt', '.csv', '.pdf', '.msg' into a List[Document] using ๐Ÿฆœ๏ธ๐Ÿ”— LangChain <langchain_core.documents> Document

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published