MiniLM Encoder + CJK-Only Queries on MOOC-CS: Hierarchical R@10 Rises from 23.1 to 26.5 with Flat Dense at 21.7
On MOOC-CS () with the English MiniLM dense encoder held fixed, switching the query side from English templates to CJK-only concept-name queries raises flat-dense Recall@ from 16.0 [11.5, 20.9] to 21.7 [17.0, 27.0], the hierarchical baseline from 23.1 [17.2, 29.3] to 26.5 [20.1, 33.7], and Adaptive (heuristic) from 23.1 to 26.4. This row isolates the effect of query cleaning alone under an unchanged English-only encoder.
0
1
Tags
Science
Auditable Strict-Parity Evaluation of Prerequisite-Graph Retrieval for RAG under Leakage Controls
Related
Template Stripping on MOOC-CS Raises Hierarchical R@10 from 23.1 to 26.5 (MiniLM Encoder)
Multilingual Encoder Alone Does Not Improve MOOC-CS Recall (Hierarchical R@10 = 22.3 vs 23.1)
Multilingual Encoder + CJK-Only Queries Jumps MOOC-CS Hierarchical R@10 to 68.1
Graph Effect on MOOC-CS Is Conditional on Dense Seed Pool Quality
MiniLM Encoder + CJK-Only Queries on MOOC-CS: Hierarchical R@10 Rises from 23.1 to 26.5 with Flat Dense at 21.7
Multilingual Encoder + CJK-Only Queries Jumps MOOC-CS Hierarchical R@10 to 68.1
Template Stripping on MOOC-CS Raises Hierarchical R@10 from 23.1 to 26.5 (MiniLM Encoder)
Multilingual Encoder Alone Does Not Improve MOOC-CS Recall (Hierarchical R@10 = 22.3 vs 23.1)
MiniLM Encoder + CJK-Only Queries on MOOC-CS: Hierarchical R@10 Rises from 23.1 to 26.5 with Flat Dense at 21.7
Graph Effect on MOOC-CS Is Conditional on Dense Seed Pool Quality