Examples¶
Detailed, runnable examples are available as Jupyter Notebooks in the examples
folder:
-
Using GAICo's
Experiment
module to provide a simple, quickstart workflow. -
Evaluating multiple models (LLMs, Google, and Custom) using a single metric.
-
Evaluating a single model on multiple metrics.
-
The aim for this notebook was to aid with evaluating DeepSeek R1 for AI4Society's Point of View (POV).
Advanced Examples¶
The advanced-examples
directory contains advances notebooks showcasing more complex use cases and metrics. These examples are intended for users who are already familiar with the basics of GAICo. Please refer to the README.md file in that directory for details. A quick description:
-
Comparison of various LLM responses (Phi, Mixtral, etc.) on FAQ dataset from USC.
-
Exploration of default and custom thresholding techniques for LLM responses.
-
Hands-on visualizations for LLM results.