Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

G Retriever Experiments and Improvements (full) #9666

Draft
wants to merge 934 commits into
base: master
Choose a base branch
from

Conversation

zaristei
Copy link

Follow up to PR 9597. Includes multiple changes related to LLM+GNN experiments and scaling up to a remote backend. Including:

  • LargeGraphIndexer for building a large knowledge graph locally from multiple samples in an arbitrary dataset
  • Remote Backend Loader and examples for deploying a Retrieval algorithm to a third party backend FeatureStore or GraphStore
  • NVTX profiling tools for nsys users
  • Quality of Life improvements and benchmarking scripts for G-Retriever.

puririshi98 and others added 30 commits July 25, 2024 09:28
@zaristei
Copy link
Author

Aiming to resolve the merge conflicts by EoW. I can also break this PR down further if needed.

@puririshi98 puririshi98 self-requested a review September 16, 2024 23:03
zaristei and others added 3 commits September 26, 2024 17:16
# Conflicts:
#	CHANGELOG.md
#	test/nn/models/test_g_retriever.py
#	torch_geometric/datasets/__init__.py
#	torch_geometric/datasets/web_qsp_dataset.py
#	torch_geometric/nn/models/g_retriever.py
#	torch_geometric/nn/nlp/llm.py
#	torch_geometric/nn/nlp/sentence_transformer.py
@zaristei
Copy link
Author

zaristei commented Sep 27, 2024

Merge Conflicts are resolved. Currently running the following tests to make sure nothing has broken:

  • WebQSP Train Val Test Split
  • WebQSP Truncate Limit
  • GRetriever Example
  • GRetriever Unit tests
  • Remote Backend Script in Examples
  • Multihop Script in Examples
  • NVTX script in Examples

hallucinates_df.to_csv(root_dir + "/hallucinates_df.csv", index=False)


def minimal_demo(gnn_llm_eval_outs, dataset, lr, epochs, batch_size,
Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@puririshi98 FYI: I re-added the minimal demo to this PR, but didn't include it in the critical path of running this file, not sure whether you were going to re-add this minimal demo in a separate PR.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants