diff --git a/docs/articles/rag-application-communication-system.md b/docs/articles/rag-application-communication-system.md index 1ba551b06..7de6a2fbb 100644 --- a/docs/articles/rag-application-communication-system.md +++ b/docs/articles/rag-application-communication-system.md @@ -95,14 +95,12 @@ In practical terms: In this time of LLMs, older information retrieval methods and indicators continue to hold a lot of unrealized value, especially now that it's possible to generate/extract many key data features at scale. Jo Kristian Bergum from Vespa, for example, has [convincingly demonstrated](https://blog.vespa.ai/improving-retrieval-with-llm-as-a-judge/) how classic info retrieval evaluation design and metrics (precision at k, recall) can be effectively repurposed using emerging practices in AI, such as LLM-as-a-Judge - grounded on a small but scalable relevant dataset. Intensive data work that would have been available only to large scale organizations is now scalable with far fewer resources. -