Ledger benchmark · SEC filings · finance RAG

Measuring retrieval quality for finance documents.

This benchmark compares embedding and reranker stacks on SEC filing retrieval: which models surface the right evidence for DDQ-style questions, cross-company comparisons, and multi-hop finance analysis.

Best Overall

0.902
Gemini-emb-2 + cohere-4-pro
PAID/PAID

Best Free/Free

0.846
Qwen3-8B + raw
FREE/FREE

Best Free/Paid

0.837
Octen-8B + cohere-4-fast
FREE/PAID

Best Paid/Free

0.892
Gemini-emb-2 + zerank-2
PAID/FREE

Best Paid/Paid

0.902
Gemini-emb-2 + cohere-4-pro
PAID/PAID

Best 3-Hop

93%
Octen-8B + cohere-4-fast
FREE/PAID

What we are measuring

In finance RAG, the hard part is not generating a fluent answer — it is retrieving the exact filing evidence that supports the answer. This page measures retrieval quality before generation. Each row is an embedding model plus an optional reranker, scored on whether the correct SEC filing chunks appear near the top of the retrieval results.

6,151SEC filing chunks
10portfolio companies
88model/reranker rows
4cost buckets

Methodology

Corpus. Ten real SEC filings were downloaded for a synthetic private-equity portfolio and parsed into 6,151 retrieval chunks.

Questions. The benchmark covers single-company facts, negative/not-in-portfolio questions, fund-level questions, cross-company questions, and harder multi-hop finance queries.

Pipeline. Each query retrieves candidates from the vector index. Some runs stop at raw vector retrieval; others rerank the top candidates with local or paid rerankers before scoring the final top five.

MRR rewards systems that rank the first correct evidence chunk highest.

Hit@5 checks whether at least one relevant chunk appears in the final top five.

1-Hop / 2-Hop / 3-Hop measure recall on increasingly complex question types. The 2-hop and 3-hop columns are especially important for finance work because many DDQ and filing-analysis questions require evidence across multiple companies or sections.

Full benchmark table

Cost buckets: free/free: 42, free/paid: 31, paid/free: 9, paid/paid: 6
#EmbeddingRerankerCostMRRHit@51-Hop2-Hop3-Hop
1Gemini-emb-2cohere-4-proPAID/PAID0.902100%100%80%80%
2Gemini-emb-2zerank-2PAID/FREE0.89295%100%90%67%
3text-emb-3-smallcohere-4-proPAID/PAID0.892100%100%70%80%
4Gemini-emb-2cohere-4-fastPAID/PAID0.87795%100%80%67%
5Gemini-emb-2nemotron-rrPAID/FREE0.860100%100%80%73%
6text-emb-3-smallzerank-2PAID/FREE0.85095%90%90%53%
7Qwen3-8BrawFREE/FREE0.84691%83%71%67%
8Octen-8Bcohere-4-fastFREE/PAID0.83791%83%67%93%
9Qwen3-0.6Bcohere-4-fastFREE/PAID0.83091%83%71%53%
10Nemotron-1bzerank-2PAID/FREE0.82990%90%80%60%
11Qwen3-0.6Bcohere-4-proFREE/PAID0.82891%83%71%47%
12Octen-8Bcohere-4-proFREE/PAID0.82687%83%62%87%
13Qwen3-0.6Bcohere-fastFREE/PAID0.82687%83%76%47%
14Qwen3-8Bcohere-fastFREE/PAID0.82691%83%71%53%
15Qwen3-8Bcohere-4-proFREE/PAID0.82091%83%71%60%
16text-emb-3-smallnemotron-rrPAID/FREE0.818100%100%80%67%
17text-emb-3-smallcohere-4-fastPAID/PAID0.81495%100%90%67%
18Gemini-emb-2rawPAID/FREE0.808100%100%70%67%
19Nemotron-1bcohere-4-fastPAID/PAID0.80490%90%70%53%
20Nemotron-1bnemotron-rrPAID/FREE0.80490%90%90%73%
21Harrier-27bjina-v3FREE/FREE0.79787%75%59%47%
22Jina-v5bge-m3FREE/FREE0.79787%83%65%53%
23Octen-4Bbge-m3FREE/FREE0.79787%83%68%80%
24Qwen3-8Bcohere-4-fastFREE/PAID0.79791%83%71%60%
25Nemotron-1brawPAID/FREE0.79790%80%80%73%
26Nemotron-1bcohere-4-proPAID/PAID0.79790%90%70%67%
27Qwen3-4Bcohere-4-proFREE/PAID0.79591%83%51%53%
28Qwen3-0.6BrawFREE/FREE0.79391%83%62%60%
29Jina-v5rawFREE/FREE0.79183%83%67%33%
30Octen-4Bcohere-4-fastFREE/PAID0.79091%83%79%67%
31E5-base-v2rawFREE/FREE0.78990%80%90%73%
32Qwen3-0.6Bbge-m3FREE/FREE0.78687%83%79%47%
33Qwen3-8Bjina-v3FREE/FREE0.78691%75%67%67%
34Jina-v5cohere-fastFREE/PAID0.78491%83%67%53%
35Qwen3-4Bcohere-4-fastFREE/PAID0.78391%83%59%67%
36Octen-8BrawFREE/FREE0.78091%83%59%73%
37Qwen3-8Bbge-m3FREE/FREE0.77991%83%71%73%
38Octen-4Bcohere-4-proFREE/PAID0.77791%83%79%60%
39Harrier-27bcohere-4-proFREE/PAID0.77591%83%76%60%
40Jina-v5cohere-4-fastFREE/PAID0.77591%83%67%67%
41Octen-8Bcohere-fastFREE/PAID0.77587%83%65%67%
42Jina-v5cohere-4-proFREE/PAID0.77391%83%67%60%
43Qwen3-4Bcohere-fastFREE/PAID0.77391%83%51%67%
44Octen-4BrawFREE/FREE0.76887%83%67%60%
45Octen-8Bbge-m3FREE/FREE0.76887%83%62%67%
46Qwen3-0.6Bjina-v3FREE/FREE0.76887%75%67%47%
47Octen-4Bcohere-fastFREE/PAID0.76791%83%67%47%
48Octen-8Bqwen3FREE/FREE0.76487%83%70%73%
49Qwen3-8Bqwen3FREE/FREE0.76187%83%62%60%
50Nemotron-8bcohere-4-proFREE/PAID0.75491%83%54%60%
51Octen-4Bjina-v3FREE/FREE0.75491%67%71%73%
52Octen-8Bjina-v3FREE/FREE0.75483%83%67%73%
53E5-base-v2zerank-2FREE/FREE0.73880%80%80%33%
54E5-large-v2cohere-4-fastFREE/PAID0.73580%70%60%60%
55text-emb-3-smallrawPAID/FREE0.73385%70%80%53%
56Octen-4Bqwen3FREE/FREE0.72187%83%73%67%
57Qwen3-4Bjina-v3FREE/FREE0.72187%75%64%67%
58Nemotron-8brawFREE/FREE0.71987%75%54%60%
59E5-base-v2cohere-4-fastFREE/PAID0.71885%80%80%53%
60KaLM-12Bjina-v3FREE/FREE0.71787%83%59%60%
61Qwen3-4Bbge-m3FREE/FREE0.71791%83%48%67%
62Nemotron-8bbge-m3FREE/FREE0.71491%83%73%53%
63Qwen3-4Bqwen3FREE/FREE0.71487%83%40%67%
64E5-base-v2cohere-4-proFREE/PAID0.71385%80%80%47%
65Qwen3-4BrawFREE/FREE0.71287%83%67%60%
66Qwen3-0.6Bqwen3FREE/FREE0.71078%83%64%40%
67E5-large-v2nemotron-rrFREE/FREE0.70480%70%70%60%
68E5-large-v2zerank-2FREE/FREE0.69880%70%70%53%
69Harrier-27brawFREE/FREE0.69787%83%72%67%
70E5-large-v2cohere-4-proFREE/PAID0.69580%70%60%60%
71Jina-v5jina-v3FREE/FREE0.69391%67%67%60%
72Jina-v5qwen3FREE/FREE0.69287%83%62%53%
73Harrier-27bcohere-4-fastFREE/PAID0.68391%83%76%60%
74KaLM-12Bcohere-4-proFREE/PAID0.67887%83%68%47%
75Harrier-27bbge-m3FREE/FREE0.67474%75%50%47%
76Harrier-27bcohere-fastFREE/PAID0.67283%83%65%53%
77Nemotron-8bcohere-4-fastFREE/PAID0.67087%83%56%53%
78E5-base-v2nemotron-rrFREE/FREE0.66485%80%70%60%
79Nemotron-8bjina-v3FREE/FREE0.66187%83%61%73%
80KaLM-12Bcohere-fastFREE/PAID0.65678%75%77%40%
81KaLM-12Bcohere-4-fastFREE/PAID0.64587%83%71%60%
82Nemotron-8bcohere-fastFREE/PAID0.64487%75%70%53%
83Harrier-27bqwen3FREE/FREE0.63783%83%70%53%
84E5-large-v2rawFREE/FREE0.61770%60%60%53%
85KaLM-12Bqwen3FREE/FREE0.60578%75%62%53%
86KaLM-12Bbge-m3FREE/FREE0.60170%58%68%53%
87KaLM-12BrawFREE/FREE0.56565%58%49%27%
88Nemotron-8bqwen3FREE/FREE0.49883%75%54%60%
made with 🍵 by sorcerai·@ariapramesi