The Graphlit Agent Tools for Python enables easy interaction with agent frameworks such as CrewAI or Griptape, allowing developers to easily integrate the Graphlit service with agentic workflows. This document outlines the setup process and provides a basic example of using the tools.
Before you begin, ensure you have the following:
- Python 3.x installed on your system.
- An active account on the Graphlit Platform with access to the API settings dashboard.
To install the Graphlit Agent Tools with CrewAI, use pip:
pip install graphlit-tools[crewai]
To install the Graphlit Agent Tools with Griptape, use pip:
pip install graphlit-tools[griptape]
We have example Google Colab notebooks using CrewAI, which provide an example for analyzing the web marketing strategy of a company, and for structured data extraction of products from scraped web pages.
Once you have configured the Graphlit client, as shown below, you will pass the client to the tool constructor.
For use in CrewAI, you will need to convert the tool to the CrewAI tool schema with the CrewAIConverter.from_tool()
function.
For use in Griptape, you will need to convert the tool to the CrewAI tool schema with the GriptapeConverter.from_tool()
function.
We will provide support for additional agent frameworks, such as LangGraph and AutoGen in future.
from graphlit_tools import WebSearchTool, CrewAIConverter
web_search_tool = CrewAIConverter.from_tool(WebSearchTool(graphlit))
web_search_agent = Agent(
role="Web Researcher",
goal="Find the {company} website.",
backstory="",
verbose=True,
allow_delegation=False,
tools=[web_search_tool],
)
from graphlit_tools import WebSearchTool, CrewAIConverter
web_search_tool = GriptapeConverter.from_tool(WebSearchTool(graphlit))
web_search_agent = Agent(
role="Web Researcher",
goal="Find the {company} website.",
backstory="",
verbose=True,
allow_delegation=False,
tools=[web_search_tool],
)
The Graphlit Client supports environment variables to be set for authentication and configuration:
GRAPHLIT_ENVIRONMENT_ID
: Your environment ID.GRAPHLIT_ORGANIZATION_ID
: Your organization ID.GRAPHLIT_JWT_SECRET
: Your JWT secret for signing the JWT token.
Alternately, you can pass these values with the constructor of the Graphlit client.
You can find these values in the API settings dashboard on the Graphlit Platform.
For example, to use Graphlit in a Google Colab notebook, you need to assign these properties as Colab secrets: GRAPHLIT_ORGANIZATION_ID, GRAPHLIT_ENVIRONMENT_ID and GRAPHLIT_JWT_SECRET.
import os
from google.colab import userdata
from graphlit import Graphlit
os.environ['GRAPHLIT_ORGANIZATION_ID'] = userdata.get('GRAPHLIT_ORGANIZATION_ID')
os.environ['GRAPHLIT_ENVIRONMENT_ID'] = userdata.get('GRAPHLIT_ENVIRONMENT_ID')
os.environ['GRAPHLIT_JWT_SECRET'] = userdata.get('GRAPHLIT_JWT_SECRET')
graphlit = Graphlit()
To set these environment variables on your system, use the following commands, replacing your_value
with the actual values from your account.
For Unix/Linux/macOS:
export GRAPHLIT_ENVIRONMENT_ID=your_environment_id_value
export GRAPHLIT_ORGANIZATION_ID=your_organization_id_value
export GRAPHLIT_JWT_SECRET=your_secret_key_value
For Windows Command Prompt (CMD):
set GRAPHLIT_ENVIRONMENT_ID=your_environment_id_value
set GRAPHLIT_ORGANIZATION_ID=your_organization_id_value
set GRAPHLIT_JWT_SECRET=your_secret_key_value
For Windows PowerShell:
$env:GRAPHLIT_ENVIRONMENT_ID="your_environment_id_value"
$env:GRAPHLIT_ORGANIZATION_ID="your_organization_id_value"
$env:GRAPHLIT_JWT_SECRET="your_secret_key_value"
Ingests content from URL. Returns extracted Markdown text and metadata from content. Can ingest individual Word documents, PDFs, audio recordings, videos, images, or any other unstructured data.
Name | Type | Description |
---|---|---|
url | str | URL of cloud-hosted file to be ingested into knowledge base |
Ingests content from local file. Returns extracted Markdown text and metadata from content. Can ingest individual Word documents, PDFs, audio recordings, videos, images, or any other unstructured data.
Name | Type | Description |
---|---|---|
file_path | str | Path of local file to be ingested into knowledge base |
Scrapes web page into knowledge base. Returns Markdown text and metadata extracted from web page.
Name | Type | Description |
---|---|---|
url | str | URL of web page to be scraped and ingested into knowledge base |
Crawls web pages from web site into knowledge base. Returns Markdown text and metadata extracted from web pages.
Name | Type | Description |
---|---|---|
url | str | URL of web site to be crawled and ingested into knowledge base |
search | Optional[str] | Text to search for within ingested web pages |
read_limit | Optional[int] | Maximum number of web pages from web site to be crawled |
Accepts search query text as string. Performs web search based on search query. Returns Markdown text and metadata extracted from web pages.
Name | Type | Description |
---|---|---|
search | str | Text to search for within web pages across the Internet |
search_limit | Optional[int] | Maximum number of web pages to be returned from web search |
Accepts web page URL as string. Enumerates the web pages at or beneath the provided URL using web sitemap. Returns list of mapped URIs from web site.
Name | Type | Description |
---|---|---|
url | str | URL of the web page to be mapped |
Ingests posts from Reddit subreddit into knowledge base. Returns extracted Markdown text and metadata from Reddit posts.
Name | Type | Description |
---|---|---|
subreddit_name | str | Reddit subreddit name to be read and ingested into knowledge base |
search | Optional[str] | Text to search for within ingested posts |
read_limit | Optional[int] | Maximum number of posts from Reddit subreddit to be read, defaults to 10 |
Ingests pages from Notion database into knowledge base. Returns extracted Markdown text and metadata from Notion pages.
Requires NOTION_API_KEY to be assigned as environment variable.
Name | Type | Description |
---|---|---|
search | Optional[str] | Text to search for within ingested pages |
read_limit | Optional[int] | Maximum number of pages from Notion database to be read, defaults to 10 |
Ingests posts from RSS feed into knowledge base. For podcast RSS feeds, audio will be transcribed and ingested into knowledge base. Returns extracted or transcribed Markdown text and metadata from RSS posts.
Name | Type | Description |
---|---|---|
url | str | RSS URL to be read and ingested into knowledge base |
search | Optional[str] | Text to search for within ingested posts and/or transcripts |
read_limit | Optional[int] | Maximum number of posts from RSS feed to be read, defaults to 10 |
Ingests emails from Microsoft Email account into knowledge base. Returns extracted Markdown text and metadata from emails.
Requires MICROSOFT_EMAIL_CLIENT_ID, MICROSOFT_EMAIL_CLIENT_SECRET and MICROSOFT_EMAIL_REFRESH_TOKEN to be assigned as environment variables.
Name | Type | Description |
---|---|---|
search | Optional[str] | Text to search for within ingested email |
read_limit | Optional[int] | Maximum number of emails from Microsoft Email account to be read, defaults to 10 |
Ingests emails from Google Email account into knowledge base. Returns extracted Markdown text and metadata from emails.
Requires GOOGLE_EMAIL_CLIENT_ID, GOOGLE_EMAIL_CLIENT_SECRET and GOOGLE_EMAIL_REFRESH_TOKEN to be assigned as environment variables.
Name | Type | Description |
---|---|---|
search | Optional[str] | Text to search for within ingested email |
read_limit | Optional[int] | Maximum number of emails from Google Email account to be read, defaults to 10 |
Ingests issues from GitHub repository into knowledge base. Accepts GitHub repository owner and repository name. For example, for GitHub repository (https://github.com/openai/tiktoken), 'openai' is the repository owner, and 'tiktoken' is the repository name. Returns extracted Markdown text and metadata from issues.
Requires GITHUB_PERSONAL_ACCESS_TOKEN to be assigned as environment variable.
Name | Type | Description |
---|---|---|
repository_name | str | GitHub repository name |
repository_owner | str | GitHub repository owner |
search | Optional[str] | Text to search for within ingested issues |
read_limit | Optional[int] | Maximum number of issues from GitHub repository to be read, defaults to 10 |
Ingests issues from Atlassian Jira into knowledge base. Accepts Atlassian Jira server URL and project name. Returns extracted Markdown text and metadata from issues.
Requires JIRA_TOKEN and JIRA_EMAIL to be assigned as environment variables.
Name | Type | Description |
---|---|---|
url | str | Atlassian Jira server URL |
project | str | Atlassian Jira project name |
search | Optional[str] | Text to search for within ingested issues |
read_limit | Optional[int] | Maximum number of issues from Jira project to be read, defaults to 10 |
Ingests issues from Linear project into knowledge base. Accepts Linear project name. Returns extracted Markdown text and metadata from issues.
Requires LINEAR_API_KEY to be assigned as environment variable.
Name | Type | Description |
---|---|---|
project | str | Linear project name |
search | Optional[str] | Text to search for within ingested issues |
read_limit | Optional[int] | Maximum number of issues from Linear project to be read, defaults to 10 |
Ingests messages from Microsoft Teams channel into knowledge base. Returns extracted Markdown text and metadata from messages.
Requires MICROSOFT_TEAMS_CLIENT_ID, MICROSOFT_TEAMS_CLIENT_SECRET and MICROSOFT_TEAMS_REFRESH_TOKEN to be assigned as environment variables.
Name | Type | Description |
---|---|---|
team_name | str | Microsoft Teams team name |
channel_name | str | Microsoft Teams channel name |
search | Optional[str] | Text to search for within ingested messages |
read_limit | Optional[int] | Maximum number of messages from Microsoft Teams channel to be read, defaults to 10 |
Ingests messages from Discord channel into knowledge base. Accepts Discord channel name. Returns extracted Markdown text and metadata from messages.
Requires DISCORD_BOT_TOKEN to be assigned as environment variable.
Name | Type | Description |
---|---|---|
channel_name | str | Discord channel name |
search | Optional[str] | Text to search for within ingested messages |
read_limit | Optional[int] | Maximum number of messages from Discord channel to be read, defaults to 10 |
Ingests messages from Slack channel into knowledge base. Accepts Slack channel name. Returns extracted Markdown text and metadata from messages.
Requires SLACK_BOT_TOKEN to be assigned as environment variable.
Name | Type | Description |
---|---|---|
channel_name | str | Slack channel name |
search | Optional[str] | Text to search for within ingested messages |
read_limit | Optional[int] | Maximum number of messages from Slack channel to be read, defaults to 10 |
Accepts user prompt as string. Prompts LLM with relevant content and returns completion from RAG pipeline. Returns Markdown text from LLM completion. Uses vector embeddings and similarity search to retrieve relevant content from knowledge base. Can search through web pages, PDFs, audio transcripts, and other unstructured data.
Name | Type | Description |
---|---|---|
prompt | str | Text prompt which is provided to LLM for completion, via RAG pipeline |
Accepts search text as string. Optionally accepts a list of content types (i.e. FILE, PAGE, EMAIL, ISSUE, MESSAGE) for filtering the result set. Retrieves contents based on similarity search from knowledge base. Returns extracted Markdown text and metadata from contents relevant to the search text. Can search through web pages, PDFs, audio transcripts, Slack messages, emails, or any unstructured data ingested into the knowledge base.
Name | Type | Description |
---|---|---|
text | str | Text to search for within the knowledge base |
types | Optional[List[ContentTypes]] | List of content types (i.e. FILE, PAGE, EMAIL, ISSUE, MESSAGE) to be returned from knowledge base |
limit | Optional[int] | Number of contents to return from search query |
Accepts search text as string. Retrieves persons based on similarity search from knowledge base. Returns metadata from persons relevant to the search text.
Name | Type | Description |
---|---|---|
search | str | Text to search for within the knowledge base |
limit | Optional[int] | Number of persons to return from search query |
Accepts search text as string. Retrieves organizations based on similarity search from knowledge base. Returns metadata from organizations relevant to the search text.
Name | Type | Description |
---|---|---|
search | str | Text to search for within the knowledge base |
limit | Optional[int] | Number of organizations to return from search query |
Accepts image URL as string. Prompts vision LLM and returns completion. Returns Markdown text from LLM completion.
Name | Type | Description |
---|---|---|
url | str | URL for image to be described with vision LLM |
prompt | str | Text prompt which is provided to vision LLM for completion |
Screenshots web page from URL and describes web page with vision LLM. Returns Markdown description of screenshot and extracted Markdown text from image.
Name | Type | Description |
---|---|---|
url | str | URL of web page to screenshot and ingest into knowledge base |
prompt | Optional[str] | Text prompt which is provided to vision LLM for screenshot description |
Accepts text as string. Optionally accepts text prompt to be provided to LLM for text summarization. Returns summary as text.
Name | Type | Description |
---|---|---|
text | str | Text to be summarized |
prompt | Optional[str] | Text prompt which is provided to LLM for text summarization |
Accepts text as string. Optionally accepts the count of bullet points to be generated. Returns bullet points as text.
Name | Type | Description |
---|---|---|
text | str | Text to be summarized into bullet points |
count | Optional[int] | Number of bullet points to be generated |
Accepts text as string. Optionally accepts the count of headlines to be generated. Returns headlines as text.
Name | Type | Description |
---|---|---|
text | str | Text to be summarized into headlines |
count | Optional[int] | Number of headlines to be generated |
Accepts text as string. Optionally accepts the count of social media posts to be generated. Returns social media posts as text.
Name | Type | Description |
---|---|---|
text | str | Text to be summarized into social media posts |
count | Optional[int] | Number of social media posts to be generated |
Accepts text as string. Optionally accepts the count of followup questions to be generated. Returns followup questions as text.
Name | Type | Description |
---|---|---|
text | str | Text to be summarized into followup questions |
count | Optional[int] | Number of followup questions to be generated |
Accepts text as string. Optionally accepts the count of keywords to be generated. Returns keywords as text.
Name | Type | Description |
---|---|---|
text | str | Text to be summarized into keywords |
count | Optional[int] | Number of keywords to be generated |
Accepts transcript as string. Returns chapters as text.
Name | Type | Description |
---|---|---|
text | str | Transcript to be summarized into chapters. Assumes transcript contains time-stamped text. |
Extracts JSON data from ingested file using LLM. Accepts URL to be ingested, and JSON schema of Pydantic model to be extracted into. JSON schema needs be of type 'object' and include 'properties' and 'required' fields. Returns extracted JSON from file.
Name | Type | Description |
---|---|---|
uri | str | URL of cloud-hosted file to be ingested into knowledge base |
model_schema | str | Pydantic model JSON schema which describes the data which will be extracted. JSON schema needs be of type 'object' and include 'properties' and 'required' fields. |
prompt | Optional[str] | Text prompt which is provided to LLM to guide data extraction |
Extracts JSON data from ingested web page using LLM. Accepts URL to be scraped, and JSON schema of Pydantic model to be extracted into. JSON schema needs be of type 'object' and include 'properties' and 'required' fields. Returns extracted JSON from web page.
Name | Type | Description |
---|---|---|
uri | str | URL of web page to be scraped and ingested into knowledge base |
model_schema | str | Pydantic model JSON schema which describes the data which will be extracted. JSON schema needs be of type 'object' and include 'properties' and 'required' fields. |
prompt | Optional[str] | Text prompt which is provided to LLM to guide data extraction |
Extracts JSON data from text using LLM. Accepts text to be scraped, and JSON schema of Pydantic model to be extracted into. JSON schema needs be of type 'object' and include 'properties' and 'required' fields. Returns extracted JSON from text.
Name | Type | Description |
---|---|---|
text | str | Text to be extracted with LLM |
model_schema | str | Pydantic model JSON schema which describes the data which will be extracted. JSON schema needs be of type 'object' and include 'properties' and 'required' fields. |
prompt | Optional[str] | Text prompt which is provided to LLM to guide data extraction |
Please refer to the Graphlit API Documentation.
For support with the Graphlit Agent Tools or to request an additional tool, please submit a GitHub Issue.
For further support with the Graphlit Platform, please join our Discord community.