Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

AI/ML - text summarization #1

Open
zoometh opened this issue May 22, 2024 · 0 comments
Open

AI/ML - text summarization #1

zoometh opened this issue May 22, 2024 · 0 comments

Comments

@zoometh
Copy link
Owner

zoometh commented May 22, 2024

I have consistently maintained two primary DOCX files (Sites.docx and Cultures.docx) over the years, each comprising approximately 1,500 pages. These documents are primarily written in French and focus on the European Pre- and Protohistory, specifically detailing archaeological sites (e.g., Stonehenge) and archaeological cultures or periods (e.g., Neolithic). The content includes tables, cross-references (hyperlinks and anchors between Sites.docx and Cultures.docx), bibliographic reference keys, and idiosyncratic notations (e.g., EN for Early Neolithic, BIB followed by a number for a bibliographic reference). Additionally, the files are organized with titles and subtitles defining:

  • temporal groupings (e.g., Neolithic covering Early, Middle, and Late Neolithic);

  • spatial groupings (e.g., Early Neolithic Atlantic, Early Neolithic West Mediterranean);

  • typological groupings (e.g., megalithism covering dolmen, menhir, and tumuli; weapons covering axes, spears, swords, etc.).

image

I am seeking extractive summarization through prompts capable of managing the intricate structure of my archives. For instance, feeding the LLM with these two documents I would like to receive summaries such as:

Resume the different chapters on funerary practices during the Early Neolithic and list the sites with skeleton remains.

What would be the best way to continue maintaining these archives in a flexible word editor (ex: Microsoft Word) while still being able to leverage AI/ML text extractive summarization for knowledge discovery?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant