Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

πŸš€ Roadmap πŸš€ #2

Open
5 of 9 tasks
PeriniM opened this issue Sep 9, 2024 · 0 comments
Open
5 of 9 tasks

πŸš€ Roadmap πŸš€ #2

PeriniM opened this issue Sep 9, 2024 · 0 comments

Comments

@PeriniM
Copy link
Contributor

PeriniM commented Sep 9, 2024

We are considering adding the following features to enhance the functionality and flexibility of the library:

βœ… TODOs:

  • πŸ“ Update with New Document

    • Goal: When a new or updated document is loaded, the library should recognize entities and relationships that are already present in the existing schema and integrate them with the new information. Functions for matching and identifying overlaps between the current schema and the one extracted from the new document should be implemented.
    • Possible Implementation:
      • Analyze the nodes (entities) and edges (relationships) of the current schema.
      • Deduplication algorithms (e.g., comparing identifiers or entity names).
      • Reconciliation between new entities/relationships and pre-existing ones.
  • πŸ“Š Update with Information from Domain Experts

    • Goal: Allow users to manually add specific information to the model, improving output quality with their expertise or adding information from non-structured sources (e.g., images or notes).
    • Possible Implementation:
      • Support input of textual data or other sources (such as images, charts).
      • If the added information is textual, use LLM to parse and integrate the information into the current graph or schema.
      • If the input is an image or other file types, support OCR or visual analysis to extract entities and relationships.
  • πŸ”€ Merge Two Schemas

    • Goal: Merge two schemas from different documents or sources.
    • Possible Implementation:
      • Before merging, establish whether the two schemas refer to the same domain or overlapping topics.
      • Implement algorithms to compare entities in the two schemas and find similarities (e.g., using similarity metrics on entity and relationship names).
      • Choose a merge strategy to resolve conflicts (e.g., handling duplications or inconsistencies).
  • πŸ—‘οΈ Delete Functionality (on Schema, Entities, or Relationships)

    • Goal: Allow users to modify the schema by removing entities or relationships that are no longer relevant.
    • Possible Implementation:
      • Create a simple interface or API where the user can specify the entity or relationship to delete (e.g., via ID or description).
      • Provide users the ability to describe what they want to remove in natural language and use LLM to interpret and apply the request.
  • πŸ” Conditional Generation Based on User Prompt

    • Goal: Allow users to specify exactly which parts of the document or schema they want to extract and represent.
    • Possible Implementation:
      • Users provide a specific prompt (e.g., β€œextract only financial-related entities”).
      • The LLM interprets the prompt and filters the output, keeping only what is relevant to the requested context.
  • πŸ”„ Iterative Refinement of Schema Generation

    • Goal: Improve the schema progressively through an iterative process, where each subsequent step is validated and optimized based on feedback.
    • Possible Implementation:
      • After generating a schema, the system could re-check with the LLM for accuracy or ask for specific improvements, such as adding missing details or removing redundant information.
      • Each cycle could include conditional queries, for example: "Do you see any inconsistencies in this schema?"
  • πŸ“œ Change Tracking:

    • Goal: Implement a versioning system that tracks every modification made to the schema so that users can see how it has been updated over time.
    • Possible Implementation:
      • Maintain a log of updates, showing changes in entities and relationships with timestamps and user inputs.
  • πŸ–ΌοΈ Interactive Schema Visualization:

    • Goal: Add a graphical visualization feature that shows entities and relationships interactively and in an updatable manner.
    • Possible Implementation:
      • Use libraries like pyecharts for basic visualization, with an interactive layer allowing users to explore the schema.
  • πŸ”— Integration with External Knowledge Bases:

    • Goal: The system should suggest missing relationships or additional information based on data from external knowledge bases such as Wikidata.
    • Possible Implementation:
      • Use APIs to query external knowledge bases and cross-reference data from the extracted schema, enriching entities and relationships.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant