currently, AutoRAG system not allow variations of each metric in once experiments.
examples:
Rouge - need to choice between Rouge-1, Rouge-2, Rouge-L, Rouge-sum (supported from RougeScorer)
BLEU - each "n" gram
sem score - if user want to compare between many model embedding?
How should we do?
Pay now to fund the work behind this issue.
Get updates on progress being made.
Maintainer is rewarded once the issue is completed.
You're funding impactful open source efforts
You want to contribute to this effort
You want to get funding like this too