Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Logan/merge next #10676

Merged
merged 421 commits into from
Feb 16, 2024
Merged
Show file tree
Hide file tree
Changes from 250 commits
Commits
Show all changes
421 commits
Select commit Hold shift + click to select a range
e397569
wip
Disiok Nov 3, 2023
5d88b59
wip
Disiok Nov 3, 2023
aa749a2
Merge pull request #20 from run-llama/logan/token_counting_fixes
Disiok Nov 3, 2023
7087fc1
wip
Disiok Nov 3, 2023
8c5efc6
fix lint
Disiok Nov 3, 2023
e000268
fix lint
Disiok Nov 3, 2023
a01511b
wip
Disiok Nov 3, 2023
96888c7
remove test file
Disiok Nov 3, 2023
91b8e46
wip
Disiok Nov 3, 2023
e41a7c9
Merge pull request #22 from run-llama/suo/merge_main
Disiok Nov 3, 2023
92f1c5f
wip
Disiok Nov 4, 2023
2db67dd
Merge pull request #23 from run-llama/suo/update_client
Disiok Nov 4, 2023
ab15f1e
s/textnodeparser/textsplitter etc
yisding Nov 8, 2023
874745f
openai version for ci
yisding Nov 8, 2023
27f41fb
linting
logan-markewich Nov 8, 2023
14dcdf9
imports
logan-markewich Nov 8, 2023
d5a502c
more linting
logan-markewich Nov 9, 2023
c44ac1d
add pipeline caching
logan-markewich Nov 9, 2023
8a67f1e
Merge pull request #24 from run-llama/revert_text_splitter_naming
Disiok Nov 9, 2023
2bd48ed
wip
Disiok Nov 9, 2023
454d608
wip
Disiok Nov 9, 2023
62ea9a0
wip
Disiok Nov 9, 2023
212c68f
Merge pull request #28 from run-llama/suo/next_merge_origin_main
Disiok Nov 9, 2023
353a520
fix tests
logan-markewich Nov 9, 2023
d8048fc
fix linting
logan-markewich Nov 9, 2023
f4292b2
more linting
logan-markewich Nov 9, 2023
704db96
add cache clear function
logan-markewich Nov 9, 2023
71cb914
add comments about keys_to_remove
logan-markewich Nov 9, 2023
6f1812a
simplify unstable value search
logan-markewich Nov 9, 2023
0133b5d
simplify to/from cache json
logan-markewich Nov 9, 2023
bd70866
Merge pull request #27 from run-llama/logan/add_transform_cache
logan-markewich Nov 9, 2023
5b95ade
use platform api key
Nov 10, 2023
54218b5
use random names for data sources
Disiok Nov 13, 2023
3319c1b
Merge pull request #29 from run-llama/javi/framework-api-key
Disiok Nov 13, 2023
3d79e16
Merge pull request #30 from run-llama/suo/fix_dup_doc
Disiok Nov 13, 2023
4c5695a
make default auth key None
Disiok Nov 13, 2023
c4b069a
wip
Disiok Nov 13, 2023
9f436a1
Merge pull request #31 from run-llama/suo/no_auth_option
Disiok Nov 13, 2023
cf6e44b
add dataset upload
logan-markewich Nov 13, 2023
f375619
api name changes
logan-markewich Nov 13, 2023
52f2c64
remove RawFile as data source type
sourabhdesai Nov 13, 2023
d982789
Merge pull request #33 from run-llama/sour/remove_raw_file
Disiok Nov 13, 2023
5bae568
update naming
logan-markewich Nov 13, 2023
e34bd33
Merge pull request #32 from run-llama/logan/add_dataset_upload
logan-markewich Nov 13, 2023
468f0d4
New client again
logan-markewich Nov 13, 2023
8fd796e
Merge pull request #34 from run-llama/logan/new_new_client
Disiok Nov 14, 2023
ce6e065
check for duplicate datasets
logan-markewich Nov 15, 2023
bbc6b41
enum update
logan-markewich Nov 15, 2023
85abdc5
wip
Disiok Nov 15, 2023
1e8affc
support both append and overwrite
logan-markewich Nov 15, 2023
43fda20
appease the typing gods
logan-markewich Nov 15, 2023
35195a6
wip
Disiok Nov 15, 2023
fa4ab3e
Update transformations.py
Disiok Nov 16, 2023
faea6a4
merge
Disiok Nov 16, 2023
f2df259
Merge branch 'logan/dataset_and_enums' of github.com:run-llama/llama_…
Disiok Nov 16, 2023
bd40980
Merge pull request #37 from run-llama/logan/dataset_and_enums
Disiok Nov 16, 2023
89c4dd5
Support from_documents upload
logan-markewich Nov 16, 2023
4bb793e
Support from_documents upload
logan-markewich Nov 16, 2023
2f5d711
run_transformations import
logan-markewich Nov 16, 2023
0103cb6
linting
logan-markewich Nov 16, 2023
f73bb9d
linting
logan-markewich Nov 16, 2023
c3a8c42
rip out client
logan-markewich Nov 16, 2023
3f1e8e6
comments
logan-markewich Nov 16, 2023
686e788
bump client version
logan-markewich Nov 16, 2023
8ce462c
lock
logan-markewich Nov 16, 2023
6aca6aa
linting
logan-markewich Nov 16, 2023
bf159ba
linting
logan-markewich Nov 16, 2023
99108b2
Merge pull request #38 from run-llama/logan/from_documents
logan-markewich Nov 16, 2023
5bf9f5a
Merge branch 'next' into logan/remove_client
logan-markewich Nov 16, 2023
afcd671
wip
Disiok Nov 17, 2023
eeeb707
Merge pull request #40 from run-llama/suo/update_url
Disiok Nov 17, 2023
3323d34
merge
Disiok Nov 17, 2023
ad2d3c2
fix imports
logan-markewich Nov 17, 2023
0f33702
Merge pull request #39 from run-llama/logan/remove_client
logan-markewich Nov 17, 2023
d736f39
wip
Disiok Nov 21, 2023
2b02992
Merge pull request #41 from run-llama/suo/fix_global
Disiok Nov 21, 2023
242466d
wip
Disiok Nov 21, 2023
e191d4d
add DocumentGroup class
sourabhdesai Nov 22, 2023
245fc3a
set default DocumentGroup data source name to file path
sourabhdesai Nov 22, 2023
1dbac29
merge main
logan-markewich Nov 22, 2023
b5f0e64
linting
logan-markewich Nov 22, 2023
24c63f7
Merge pull request #42 from run-llama/suo/eval_with_feedback
Disiok Nov 22, 2023
24e58e7
poetry
logan-markewich Nov 22, 2023
02a52a6
tests
logan-markewich Nov 22, 2023
07d8d40
linting
logan-markewich Nov 22, 2023
be14919
Merge pull request #44 from run-llama/logan/merge_main_again
logan-markewich Nov 22, 2023
6586c3d
Merge pull request #43 from run-llama/sour/doc_groups
Disiok Nov 23, 2023
d7a8393
wip
Disiok Nov 23, 2023
f96dad5
Merge pull request #45 from run-llama/suo/fix_bug_again_after_merge
Disiok Nov 23, 2023
79abea8
improve openai key validation
logan-markewich Nov 27, 2023
29b53b6
merge main
logan-markewich Nov 27, 2023
0e70623
remove data_sinks, allow multiple readers
logan-markewich Nov 27, 2023
bc33d03
revert vector store typing
logan-markewich Nov 27, 2023
2fa6264
tests
logan-markewich Nov 27, 2023
eeb3414
linting
logan-markewich Nov 27, 2023
081b7b7
Merge pull request #47 from run-llama/logan/merge_latest_main
logan-markewich Nov 27, 2023
0309689
fix uplaod_eval_dataset
logan-markewich Nov 28, 2023
76a276b
Merge pull request #48 from run-llama/logan/fix_upload_eval_dataset
logan-markewich Nov 28, 2023
9f00aa5
add back data sinks
logan-markewich Nov 28, 2023
8103890
Merge pull request #49 from run-llama/logan/bring_back_data_sink_enums
logan-markewich Nov 28, 2023
976659b
upload empty data sinks
logan-markewich Nov 28, 2023
f48eded
bump client version
logan-markewich Nov 28, 2023
95613e7
remove unsupported transformations
logan-markewich Nov 29, 2023
8bf4e89
fix tests
logan-markewich Nov 29, 2023
8a4b434
Merge pull request #50 from run-llama/logan/remove_enums
logan-markewich Nov 29, 2023
2558a67
Fix test
logan-markewich Nov 29, 2023
533060e
merge main
logan-markewich Nov 30, 2023
0d4a2c9
specify alpha version
logan-markewich Nov 30, 2023
33220ed
fix bad merge
logan-markewich Nov 30, 2023
4be7fe7
Merge pull request #51 from run-llama/logan/alpha_release_version
logan-markewich Nov 30, 2023
ff5654f
enable integration tests in CI
logan-markewich Nov 30, 2023
3641c58
use proper unit test make
logan-markewich Nov 30, 2023
8a0b2f1
add eval dataset test
logan-markewich Nov 30, 2023
aeb9f17
fix eval tests
logan-markewich Nov 30, 2023
c0dfe3d
fix eval tests
logan-markewich Nov 30, 2023
a4d83ec
linting
logan-markewich Nov 30, 2023
345b956
more linting for managed
logan-markewich Nov 30, 2023
2cc636a
linting..
logan-markewich Nov 30, 2023
84b0f28
typo
logan-markewich Nov 30, 2023
81ba8f4
make eval test less flakey
logan-markewich Nov 30, 2023
6ebce5f
unique project names
logan-markewich Dec 1, 2023
0c180b9
Merge pull request #52 from run-llama/logan/activate_integration_tests
logan-markewich Dec 1, 2023
672b017
add reuse_client option to openai
logan-markewich Dec 1, 2023
8099301
typo
logan-markewich Dec 1, 2023
dbb1332
linting
logan-markewich Dec 1, 2023
1e1426e
azure fix
logan-markewich Dec 1, 2023
4250e4e
update client
logan-markewich Dec 1, 2023
c2a997d
bump package alpha version
logan-markewich Dec 1, 2023
f730a38
version
logan-markewich Dec 1, 2023
6d5caba
poetry lock
logan-markewich Dec 1, 2023
29bf8ea
Merge pull request #54 from run-llama/logan/update_client
logan-markewich Dec 1, 2023
05de3bf
Merge pull request #53 from run-llama/logan/do_not_share_openai_client
logan-markewich Dec 2, 2023
62a1371
merge main
logan-markewich Dec 6, 2023
0d51629
fix empty embeddings
logan-markewich Dec 6, 2023
fb1cbdf
vesrion bump
logan-markewich Dec 6, 2023
8b4c20e
fix imports
logan-markewich Dec 6, 2023
21db54b
fix tests
logan-markewich Dec 6, 2023
47da569
fix tests
logan-markewich Dec 6, 2023
b6bd0c4
Merge pull request #55 from run-llama/logan/updates
logan-markewich Dec 6, 2023
8279108
fix empty docs
logan-markewich Dec 6, 2023
0f42618
add download of dataset from hub
nerdai Dec 14, 2023
fe14e88
wip
nerdai Dec 14, 2023
0086752
add try/except block
nerdai Dec 14, 2023
5a4d636
fix from_pipeline_name
logan-markewich Dec 14, 2023
b18b0dd
kwarg
logan-markewich Dec 14, 2023
2bfd4e2
Merge pull request #56 from run-llama/nerdai/update_upload_eval_dataset
logan-markewich Dec 14, 2023
71f519f
enable more embedding models
logan-markewich Dec 14, 2023
d6d1f69
update URLs
logan-markewich Dec 14, 2023
ea6490a
remove bedrock
logan-markewich Dec 14, 2023
6cd6579
remove slash
logan-markewich Dec 14, 2023
7a4a377
typo
logan-markewich Dec 14, 2023
111bf28
Merge pull request #57 from run-llama/logan/expose_embeddings
logan-markewich Dec 14, 2023
970ad54
version bump
logan-markewich Dec 14, 2023
748dc7f
bump dependencies
logan-markewich Dec 14, 2023
f626929
alpha version bump
logan-markewich Dec 14, 2023
50beec1
dont exclude api key
logan-markewich Dec 19, 2023
a196eb8
[version] bump to a6
logan-markewich Dec 19, 2023
6409b6e
Merge pull request #58 from run-llama/logan/revert_api_key_change
logan-markewich Dec 19, 2023
33ff51e
merge main
logan-markewich Jan 3, 2024
2aabd07
bump version
logan-markewich Jan 3, 2024
9832ac4
Make schema() work again
logan-markewich Jan 3, 2024
118b384
fix typing
logan-markewich Jan 4, 2024
844b129
fix llm loading
logan-markewich Jan 4, 2024
1dc74ff
update client version
logan-markewich Jan 4, 2024
711335e
fix test
logan-markewich Jan 4, 2024
5d422b6
update pipeline
logan-markewich Jan 4, 2024
2333c9e
fix from_pipeline_name test
logan-markewich Jan 4, 2024
88b5629
small qdrant nit
logan-markewich Jan 4, 2024
21d325b
linting
logan-markewich Jan 4, 2024
1cffcdf
Merge pull request #60 from run-llama/logan/merge_0.9.25
logan-markewich Jan 4, 2024
268d46f
version bump
logan-markewich Jan 4, 2024
f8b28f2
make llm public attribute
logan-markewich Dec 7, 2023
90df545
Revert "make llm public attribute"
logan-markewich Jan 17, 2024
c3e5eff
merge main
logan-markewich Jan 17, 2024
b984ab2
bump version
logan-markewich Jan 17, 2024
f980e26
fix some tests
logan-markewich Jan 17, 2024
aec4954
bump client version
logan-markewich Jan 18, 2024
2e4a6d6
Merge pull request #61 from run-llama/logan/merge_main_0.9.32
logan-markewich Jan 18, 2024
edc83ec
update test slightly
logan-markewich Jan 18, 2024
e721bd5
[version] bump to v0.9.33a3
logan-markewich Jan 18, 2024
8dc86b9
Merge branch 'main' into logan/merge_main_v0.9.33a2
logan-markewich Jan 18, 2024
e692748
version bump
logan-markewich Jan 18, 2024
4790203
Merge pull request #62 from run-llama/logan/merge_main_v0.9.33a2
logan-markewich Jan 18, 2024
c286555
wip
logan-markewich Jan 18, 2024
abc2d35
janky defaults
logan-markewich Jan 18, 2024
95a27c2
fix defaults + pipeline name
logan-markewich Jan 19, 2024
dd2d7c8
temp fix
logan-markewich Jan 19, 2024
e664434
poll
Disiok Jan 21, 2024
2e71493
wip
Disiok Jan 21, 2024
3807942
wip
Disiok Jan 21, 2024
81f5a28
wip
Disiok Jan 21, 2024
84f7a64
wip
Disiok Jan 22, 2024
d2527cd
wip
Disiok Jan 22, 2024
eeb4314
rename module
Disiok Jan 22, 2024
e8f74ba
Merge pull request #63 from run-llama/logan/platform_index
Disiok Jan 22, 2024
c6fcc4c
bump version to v0.9.33a4
logan-markewich Feb 2, 2024
b142515
update run_remote()
logan-markewich Feb 2, 2024
fa95936
use cloud URL
logan-markewich Feb 7, 2024
5d69764
rename url and api key vars
logan-markewich Feb 7, 2024
e21a385
make imports easier
logan-markewich Feb 7, 2024
a81c4c3
make end to end work
logan-markewich Feb 7, 2024
640668b
disable auto upload feature
Disiok Feb 10, 2024
519de83
wip
Disiok Feb 10, 2024
196b5f5
fix env var
Disiok Feb 10, 2024
02767cc
update lock
Disiok Feb 10, 2024
46cee9c
Merge pull request #64 from run-llama/logan/final_fixes
Disiok Feb 10, 2024
e2a30ea
wip
Disiok Feb 10, 2024
b9c1cf0
Merge pull request #65 from run-llama/suo/bump_version
Disiok Feb 10, 2024
066d97d
Make errors more informative, use proper app url
logan-markewich Feb 13, 2024
df3e17b
Merge pull request #66 from run-llama/logan/final_fixes_for_launch
logan-markewich Feb 13, 2024
a1ea4b3
wip
logan-markewich Feb 13, 2024
9f55b81
remove core modules from docs
logan-markewich Feb 13, 2024
046be20
remove unused makefile command
logan-markewich Feb 13, 2024
01576a8
remove end-to-end tutorials
logan-markewich Feb 13, 2024
9f99ebe
fix var name
logan-markewich Feb 13, 2024
6265c2a
clean up docs changes
logan-markewich Feb 13, 2024
bb7ca9a
update notebooks
logan-markewich Feb 13, 2024
78b39b2
fix legacy changes, add constants
logan-markewich Feb 13, 2024
b057218
fix changed test
logan-markewich Feb 13, 2024
befe907
wip
logan-markewich Feb 14, 2024
df4c4c1
wip
logan-markewich Feb 14, 2024
8d95edd
wip
logan-markewich Feb 14, 2024
ac41683
wip
logan-markewich Feb 14, 2024
6ab297d
wip
logan-markewich Feb 14, 2024
d32b473
wip
logan-markewich Feb 14, 2024
814c6c5
wip
logan-markewich Feb 14, 2024
fd4c406
wip
logan-markewich Feb 14, 2024
a0cfff6
wip
logan-markewich Feb 14, 2024
6ed82ed
wip
logan-markewich Feb 14, 2024
08d9ae3
wip
logan-markewich Feb 14, 2024
6d5d69b
bonus fix
logan-markewich Feb 14, 2024
9a62fbd
wip
logan-markewich Feb 14, 2024
1b6ba01
working commit
logan-markewich Feb 15, 2024
66de6eb
add llamaindex-py-client to deps
logan-markewich Feb 15, 2024
04e3da4
update retriever
logan-markewich Feb 15, 2024
db64f56
linting
logan-markewich Feb 15, 2024
3b3ff7c
address comments
logan-markewich Feb 15, 2024
eda9585
Merge branch 'main' into logan/merge_next
logan-markewich Feb 15, 2024
d53dadd
version bumps for sub packages
logan-markewich Feb 15, 2024
3a458ef
bring back skip tests for legacy
logan-markewich Feb 15, 2024
1955d99
remove unit test secrets
logan-markewich Feb 15, 2024
4c75a6b
no infer deps
logan-markewich Feb 15, 2024
d9e673d
more ignore deps
logan-markewich Feb 15, 2024
0859241
test caching
logan-markewich Feb 15, 2024
9e84e4f
Merge branch 'main' into logan/merge_next
logan-markewich Feb 15, 2024
f9967b6
update module mappings
logan-markewich Feb 15, 2024
7547b4d
bump client version to loosen httpx reqs
logan-markewich Feb 16, 2024
4ceea93
final fixes?
logan-markewich Feb 16, 2024
331d445
fix core tests
logan-markewich Feb 16, 2024
2b9cedb
fix circular imports and empty tests
logan-markewich Feb 16, 2024
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
2 changes: 2 additions & 0 deletions .github/workflows/unit_test.yml
Original file line number Diff line number Diff line change
Expand Up @@ -8,6 +8,8 @@ on:

env:
POETRY_VERSION: "1.6.1"
LLAMA_CLOUD_BASE_URL: ${{ secrets.PLATFORM_BASE_URL }}
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

do we need this? assuming we are not running integration tests here, seem extra.

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

yea thats stay stuff. We can remove for now

LLAMA_CLOUD_API_KEY: ${{ secrets.PLATFORM_API_KEY }}

jobs:
test:
Expand Down
116 changes: 44 additions & 72 deletions docs/examples/evaluation/faithfulness_eval.ipynb
Original file line number Diff line number Diff line change
Expand Up @@ -19,12 +19,12 @@
"metadata": {},
"outputs": [],
"source": [
"%pip install llama-index-llms-openai"
"%pip install llama-index-llms-openai pandas[jinja2] spacy"
]
},
{
"cell_type": "code",
"execution_count": null,
"execution_count": 1,
"id": "4a8304f2",
"metadata": {},
"outputs": [],
Expand All @@ -37,28 +37,23 @@
},
{
"cell_type": "code",
"execution_count": null,
"id": "9080b39e",
"execution_count": 2,
"id": "190a6684",
"metadata": {},
"outputs": [],
"source": [
"# configuring logger to INFO level\n",
"import logging\n",
"import sys\n",
"\n",
"logging.basicConfig(stream=sys.stdout, level=logging.INFO)\n",
"logging.getLogger().addHandler(logging.StreamHandler(stream=sys.stdout))"
"import os\n",
"os.environ[\"OPENAI_API_KEY\"] = \"sk-...\""
]
},
{
"cell_type": "code",
"execution_count": null,
"execution_count": 3,
"id": "8d0b2364-4806-4656-81e7-3f6e4b910b5b",
"metadata": {},
"outputs": [],
"source": [
"from llama_index.core import (\n",
" TreeIndex,\n",
" VectorStoreIndex,\n",
" SimpleDirectoryReader,\n",
" Response,\n",
Expand All @@ -81,7 +76,7 @@
},
{
"cell_type": "code",
"execution_count": null,
"execution_count": 4,
"id": "b9b98f89-d5b8-4d29-92f6-ad76d5060e9f",
"metadata": {},
"outputs": [],
Expand All @@ -94,7 +89,7 @@
},
{
"cell_type": "code",
"execution_count": null,
"execution_count": 5,
"id": "1298bbb4-c99e-431e-93ef-eb32c0a2fc2a",
"metadata": {},
"outputs": [],
Expand All @@ -104,7 +99,7 @@
},
{
"cell_type": "code",
"execution_count": null,
"execution_count": 6,
"id": "41f0e53f-77a6-40d5-94ae-3f81b01af75c",
"metadata": {},
"outputs": [],
Expand All @@ -118,13 +113,15 @@
},
{
"cell_type": "code",
"execution_count": null,
"execution_count": 7,
"id": "af730b2e-6949-4865-b7af-bb2bc60a9173",
"metadata": {},
"outputs": [],
"source": [
"from llama_index.core.evaluation import EvaluationResult\n",
"\n",
"# define jupyter display function\n",
"def display_eval_df(response: Response, eval_result: str) -> None:\n",
"def display_eval_df(response: Response, eval_result: EvaluationResult) -> None:\n",
" if response.source_nodes == []:\n",
" print(\"no response!\")\n",
" return\n",
Expand All @@ -133,6 +130,7 @@
" \"Response\": str(response),\n",
" \"Source\": response.source_nodes[0].node.text[:1000] + \"...\",\n",
" \"Evaluation Result\": \"Pass\" if eval_result.passing else \"Fail\",\n",
" \"Reasoning\": eval_result.feedback,\n",
" },\n",
" index=[0],\n",
" )\n",
Expand All @@ -156,7 +154,7 @@
},
{
"cell_type": "code",
"execution_count": null,
"execution_count": 8,
"id": "180a5d2e-9286-477b-9cd0-a5976d18d845",
"metadata": {},
"outputs": [],
Expand All @@ -168,40 +166,42 @@
},
{
"cell_type": "code",
"execution_count": null,
"execution_count": 9,
"id": "c764b8b3-69b1-4ac8-b88b-3f9e204b8bfb",
"metadata": {},
"outputs": [
{
"data": {
"text/html": [
"<style type=\"text/css\">\n",
"#T_db5e8_row0_col0, #T_db5e8_row0_col1 {\n",
"#T_21e1a_row0_col0, #T_21e1a_row0_col1 {\n",
" inline-size: 600px;\n",
" overflow-wrap: break-word;\n",
"}\n",
"</style>\n",
"<table id=\"T_db5e8\">\n",
"<table id=\"T_21e1a\">\n",
" <thead>\n",
" <tr>\n",
" <th class=\"blank level0\" >&nbsp;</th>\n",
" <th id=\"T_db5e8_level0_col0\" class=\"col_heading level0 col0\" >Response</th>\n",
" <th id=\"T_db5e8_level0_col1\" class=\"col_heading level0 col1\" >Source</th>\n",
" <th id=\"T_db5e8_level0_col2\" class=\"col_heading level0 col2\" >Evaluation Result</th>\n",
" <th id=\"T_21e1a_level0_col0\" class=\"col_heading level0 col0\" >Response</th>\n",
" <th id=\"T_21e1a_level0_col1\" class=\"col_heading level0 col1\" >Source</th>\n",
" <th id=\"T_21e1a_level0_col2\" class=\"col_heading level0 col2\" >Evaluation Result</th>\n",
" <th id=\"T_21e1a_level0_col3\" class=\"col_heading level0 col3\" >Reasoning</th>\n",
" </tr>\n",
" </thead>\n",
" <tbody>\n",
" <tr>\n",
" <th id=\"T_db5e8_level0_row0\" class=\"row_heading level0 row0\" >0</th>\n",
" <td id=\"T_db5e8_row0_col0\" class=\"data row0 col0\" >New York City got its name from the English explorer Henry Hudson, who rediscovered New York Harbor in 1609 while searching for the Northwest Passage. He named the area New York after the Duke of York, who later became King James II of England.</td>\n",
" <td id=\"T_db5e8_row0_col1\" class=\"data row0 col1\" >He claimed the area for France and named it Nouvelle Angoulême (New Angoulême).A Spanish expedition, led by the Portuguese captain Estêvão Gomes sailing for Emperor Charles V, arrived in New York Harbor in January 1525 and charted the mouth of the Hudson River, which he named Río de San Antonio ('Saint Anthony's River').The Padrón Real of 1527, the first scientific map to show the East Coast of North America continuously, was informed by Gomes' expedition and labeled the northeastern United States as Tierra de Esteban Gómez in his honor.In 1609, the English explorer Henry Hudson rediscovered New York Harbor while searching for the Northwest Passage to the Orient for the Dutch East India Company.He proceeded to sail up what the Dutch would name the North River (now the Hudson River), named first by Hudson as the Mauritius after Maurice, Prince of Orange.Hudson's first mate described the harbor as \"a very good Harbour for all windes\" and the river as \"a mile broad\" and \"full of fish\".Hud...</td>\n",
" <td id=\"T_db5e8_row0_col2\" class=\"data row0 col2\" >Fail</td>\n",
" <th id=\"T_21e1a_level0_row0\" class=\"row_heading level0 row0\" >0</th>\n",
" <td id=\"T_21e1a_row0_col0\" class=\"data row0 col0\" >New York City got its name when it came under British control in 1664. It was renamed New York after King Charles II of England granted the lands to his brother, the Duke of York.</td>\n",
" <td id=\"T_21e1a_row0_col1\" class=\"data row0 col1\" >The city came under British control in 1664 and was renamed New York after King Charles II of England granted the lands to his brother, the Duke of York. The city was regained by the Dutch in July 1673 and was renamed New Orange for one year and three months; the city has been continuously named New York since November 1674. New York City was the capital of the United States from 1785 until 1790, and has been the largest U.S. city since 1790. The Statue of Liberty greeted millions of immigrants as they came to the U.S. by ship in the late 19th and early 20th centuries, and is a symbol of the U.S. and its ideals of liberty and peace. In the 21st century, New York City has emerged as a global node of creativity, entrepreneurship, and as a symbol of freedom and cultural diversity. The New York Times has won the most Pulitzer Prizes for journalism and remains the U.S. media's \"newspaper of record\". In 2019, New York City was voted the greatest city in the world in a survey of over 30,000 p...</td>\n",
" <td id=\"T_21e1a_row0_col2\" class=\"data row0 col2\" >Pass</td>\n",
" <td id=\"T_21e1a_row0_col3\" class=\"data row0 col3\" >YES</td>\n",
" </tr>\n",
" </tbody>\n",
"</table>\n"
],
"text/plain": [
"<pandas.io.formats.style.Styler at 0x7f991be1bc40>"
"<pandas.io.formats.style.Styler at 0x169353d10>"
]
},
"metadata": {},
Expand All @@ -224,30 +224,31 @@
},
{
"cell_type": "code",
"execution_count": null,
"execution_count": 10,
"id": "90a8cd4d",
"metadata": {},
"outputs": [
{
"name": "stdout",
"name": "stderr",
"output_type": "stream",
"text": [
"WARNING:llama_index.indices.service_context:chunk_size_limit is deprecated, please specify chunk_size instead\n",
"chunk_size_limit is deprecated, please specify chunk_size instead\n",
"chunk_size_limit is deprecated, please specify chunk_size instead\n"
"/Users/loganmarkewich/giant_change/llama_index/llama-index-core/llama_index/core/evaluation/dataset_generation.py:212: DeprecationWarning: Call to deprecated class DatasetGenerator. (Deprecated in favor of `RagDatasetGenerator` which should be used instead.)\n",
" return cls(\n",
"/Users/loganmarkewich/giant_change/llama_index/llama-index-core/llama_index/core/evaluation/dataset_generation.py:309: DeprecationWarning: Call to deprecated class QueryResponseDataset. (Deprecated in favor of `LabelledRagDataset` which should be used instead.)\n",
" return QueryResponseDataset(queries=queries, responses=responses_dict)\n"
]
},
{
"data": {
"text/plain": [
"['What is the population of New York City as of 2020?',\n",
" 'Which borough of New York City is home to the headquarters of the United Nations?',\n",
" 'How many languages are spoken in New York City, making it the most linguistically diverse city in the world?',\n",
" 'Who founded the trading post on Manhattan Island that would later become New York City?',\n",
" 'What was New York City named after in 1664?']"
" 'Which city is the second-largest in the United States?',\n",
" 'How many people live within 250 miles of New York City?',\n",
" 'What are the five boroughs of New York City?',\n",
" 'What is the gross metropolitan product of the New York metropolitan area?']"
]
},
"execution_count": null,
"execution_count": 10,
"metadata": {},
"output_type": "execute_result"
}
Expand All @@ -263,7 +264,7 @@
},
{
"cell_type": "code",
"execution_count": null,
"execution_count": 11,
"id": "810ee913",
"metadata": {},
"outputs": [],
Expand All @@ -289,44 +290,14 @@
},
{
"cell_type": "code",
"execution_count": null,
"execution_count": 12,
"id": "8a7ca4b8",
"metadata": {},
"outputs": [
{
"name": "stdout",
"output_type": "stream",
"text": [
"INFO:openai:message='OpenAI API response' path=https://api.openai.com/v1/embeddings processing_ms=35 request_id=b36e17a843c31e827f0b7034e603cf28 response_code=200\n",
"message='OpenAI API response' path=https://api.openai.com/v1/embeddings processing_ms=35 request_id=b36e17a843c31e827f0b7034e603cf28 response_code=200\n",
"message='OpenAI API response' path=https://api.openai.com/v1/embeddings processing_ms=35 request_id=b36e17a843c31e827f0b7034e603cf28 response_code=200\n",
"INFO:openai:message='OpenAI API response' path=https://api.openai.com/v1/embeddings processing_ms=35 request_id=5acb726518065db9312da9f23beef411 response_code=200\n",
"message='OpenAI API response' path=https://api.openai.com/v1/embeddings processing_ms=35 request_id=5acb726518065db9312da9f23beef411 response_code=200\n",
"message='OpenAI API response' path=https://api.openai.com/v1/embeddings processing_ms=35 request_id=5acb726518065db9312da9f23beef411 response_code=200\n",
"INFO:openai:message='OpenAI API response' path=https://api.openai.com/v1/embeddings processing_ms=46 request_id=4af43bfbe4e24fdae0ec33312ee7491e response_code=200\n",
"message='OpenAI API response' path=https://api.openai.com/v1/embeddings processing_ms=46 request_id=4af43bfbe4e24fdae0ec33312ee7491e response_code=200\n",
"message='OpenAI API response' path=https://api.openai.com/v1/embeddings processing_ms=46 request_id=4af43bfbe4e24fdae0ec33312ee7491e response_code=200\n",
"INFO:openai:message='OpenAI API response' path=https://api.openai.com/v1/embeddings processing_ms=37 request_id=e30413546fe5f96d3890606767f2ec53 response_code=200\n",
"message='OpenAI API response' path=https://api.openai.com/v1/embeddings processing_ms=37 request_id=e30413546fe5f96d3890606767f2ec53 response_code=200\n",
"message='OpenAI API response' path=https://api.openai.com/v1/embeddings processing_ms=37 request_id=e30413546fe5f96d3890606767f2ec53 response_code=200\n",
"INFO:openai:message='OpenAI API response' path=https://api.openai.com/v1/embeddings processing_ms=33 request_id=01f0a8dada4dae80c97a9a412f03b84f response_code=200\n",
"message='OpenAI API response' path=https://api.openai.com/v1/embeddings processing_ms=33 request_id=01f0a8dada4dae80c97a9a412f03b84f response_code=200\n",
"message='OpenAI API response' path=https://api.openai.com/v1/embeddings processing_ms=33 request_id=01f0a8dada4dae80c97a9a412f03b84f response_code=200\n",
"INFO:openai:message='OpenAI API response' path=https://api.openai.com/v1/chat/completions processing_ms=282 request_id=ed7b1f8ba68ae32b1d8e24e0d0764e86 response_code=200\n",
"message='OpenAI API response' path=https://api.openai.com/v1/chat/completions processing_ms=282 request_id=ed7b1f8ba68ae32b1d8e24e0d0764e86 response_code=200\n",
"message='OpenAI API response' path=https://api.openai.com/v1/chat/completions processing_ms=282 request_id=ed7b1f8ba68ae32b1d8e24e0d0764e86 response_code=200\n",
"INFO:openai:message='OpenAI API response' path=https://api.openai.com/v1/chat/completions processing_ms=820 request_id=b4532c6d665b6cfd644861ed69819cb9 response_code=200\n",
"message='OpenAI API response' path=https://api.openai.com/v1/chat/completions processing_ms=820 request_id=b4532c6d665b6cfd644861ed69819cb9 response_code=200\n",
"message='OpenAI API response' path=https://api.openai.com/v1/chat/completions processing_ms=820 request_id=b4532c6d665b6cfd644861ed69819cb9 response_code=200\n",
"INFO:openai:message='OpenAI API response' path=https://api.openai.com/v1/chat/completions processing_ms=847 request_id=4d9bbc71a95b7e0bb69a048e251772c8 response_code=200\n",
"message='OpenAI API response' path=https://api.openai.com/v1/chat/completions processing_ms=847 request_id=4d9bbc71a95b7e0bb69a048e251772c8 response_code=200\n",
"message='OpenAI API response' path=https://api.openai.com/v1/chat/completions processing_ms=847 request_id=4d9bbc71a95b7e0bb69a048e251772c8 response_code=200\n",
"INFO:openai:message='OpenAI API response' path=https://api.openai.com/v1/chat/completions processing_ms=952 request_id=d1657940d881929d500b1fddc46b5866 response_code=200\n",
"message='OpenAI API response' path=https://api.openai.com/v1/chat/completions processing_ms=952 request_id=d1657940d881929d500b1fddc46b5866 response_code=200\n",
"message='OpenAI API response' path=https://api.openai.com/v1/chat/completions processing_ms=952 request_id=d1657940d881929d500b1fddc46b5866 response_code=200\n",
"INFO:openai:message='OpenAI API response' path=https://api.openai.com/v1/chat/completions processing_ms=1482 request_id=c4456f75580d227f846d3a044e5eef1b response_code=200\n",
"message='OpenAI API response' path=https://api.openai.com/v1/chat/completions processing_ms=1482 request_id=c4456f75580d227f846d3a044e5eef1b response_code=200\n",
"message='OpenAI API response' path=https://api.openai.com/v1/chat/completions processing_ms=1482 request_id=c4456f75580d227f846d3a044e5eef1b response_code=200\n",
"finished query\n",
"score: 5/5\n"
]
Expand Down Expand Up @@ -355,7 +326,8 @@
"mimetype": "text/x-python",
"name": "python",
"nbconvert_exporter": "python",
"pygments_lexer": "ipython3"
"pygments_lexer": "ipython3",
"version": "3.11.4"
}
},
"nbformat": 4,
Expand Down
Loading
Loading