Classification and curation of Listening Experiences (Demo)

component-id

type

name

description

work-package

pilot

project

resource

release-date

release-number

release-link

doi

changelog

licence

copyright

contributors

related-components

credits

child-search-expansion

WebApplication

Classification and curation of Listening Experiences with LLMs (Demo)

This demo component was developed with the aim of supporting the identification of implicit themes (classification) and metadata (curation) in text. It takes as reference the documentary evidence benchmark

WP4

CHILD

polifonia-project

https://github.com/polifonia-project/child-search-expansions/

05/09/2023

v1.0

https://github.com/polifonia-project/child-search-expansions/releases/tag/v0.1

https://zenodo.org/badge/latestdoi/588597123

https://github.com/polifonia-project/child-search-expansions/releases/tag/v0.1

Apache-2.0

Jason Carvalho <https://github.com/JaseMK>

Alba Morales Tirado <https://github.com/albamoralest>

Enrico Daga <https://github.com/enridaga>

informed-by
documentary-evidence-benchmark

https://github.com/JaseMK

https://github.com/albamoralest

https://github.com/enridaga

Classification and curation of Listening Experiences (Demo)

This small study, undertaken as part of the wider CHILD pilot, focuses on harnessing LLM technology to classify existing text extracts within LED, a task traditionally performed by human domain experts, to address the challenges posed by the volume of textual data in fields such as music history. Our experiment evaluates the effectiveness of an LLM in categorizing text extracts under the specific theme of childhood, comparing its performance with that of a human domain expert. The comparison aims to quantify the alignment between machine and human interpretations in textual analysis, look at areas where LLM technology may show weaknesses and also investigate if there areas where LLMs are able to shed new light on data that may go unnoticed by humans.

The software included here was developed with the aim of supporting the identification of implicit themes in text and takes as reference the documentary evidence benchmark.

Interactions with the ChatGPT API (or other LLM) is currently handled in the chatgpt.py file. Interactions with the LED knowledge graph are handled in led.py. In order to run any of the scripts in this distribution, a copy of config.py.dist must be made, called config.py, in which a valid OpenAI API key should be specified.

A summary of the experiements performed is provided in 'output/CHILD_text_classification_with_LLM.pdf'

Results and analysis are provided in 'output/ChatGPT-CHILD-Analysis.xlsx'

Name		Name	Last commit message	Last commit date
Latest commit History 36 Commits
data		data
output		output
templates		templates
website		website
.gitignore		.gitignore
README.md		README.md
analyseBenchmarks.py		analyseBenchmarks.py
buildBenchmarkJson.py		buildBenchmarkJson.py
chatgpt-old.py		chatgpt-old.py
chatgpt.py		chatgpt.py
concat_no_dup.py		concat_no_dup.py
config.py.dist		config.py.dist
extractCsvCol.py		extractCsvCol.py
led.py		led.py
main.py		main.py
precisionRecall.py		precisionRecall.py
runTest.py		runTest.py
sample-results.txt		sample-results.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Classification and curation of Listening Experiences (Demo)

About

Releases 3

Packages

Contributors 3

Languages

polifonia-project/child-search-expansions

Folders and files

Latest commit

History

Repository files navigation

Classification and curation of Listening Experiences (Demo)

About

Resources

Stars

Watchers

Forks

Releases 3

Packages 0

Contributors 3

Languages

Packages