FEATUREDarXiv · cs.LG· atomEN04:00 · 05·23
→SCI-Defense: Defending Manipulation Attacks from Generative Engine Optimization
SCI-Defense combines PPL, SIS, and ICD, and achieves 1.000 precision and 0.000 FPR on 600 Amazon product descriptions, with recall of 1.000, 0.952, and 0.830 against String, Reasoning, and Review attacks respectively.
#Safety#RAG#Benchmarking#Amazon
why featured
Single arXiv paper, not a model or product event; HKR-H/K/R pass because it names GEO manipulation, gives test metrics on 600 Amazon descriptions, and hits AI-search/RAG trust concerns. Featured threshold, not P1.
editor take
SCI-Defense posts perfect Amazon numbers, but 600 product descriptions with 0.000 FPR smells more like a lab fence than production GEO defense.
sharp
SCI-Defense moves GEO defense forward, but I would not read 1.000 precision as deployment-ready. The hard number is narrow: 600 Amazon product descriptions across 6 categories, with recall of 1.000, 0.952, and 0.830 on String, Reasoning, and Review attacks, plus 0.000 FPR. On 600 MS MARCO web passages, Review-attack recall drops near zero because those passages lack the persuasion signals SIS is built to catch.
The useful claim is the failure mode: PPL-only filters, SafetyClf classifiers, and paraphrasing show zero recall on semantic manipulation. That tracks with what search teams are seeing: GEO is not classic content safety; it is relevance gaming written in natural language. SCI-Defense looks like a product-page rule stack today, not a general retrieval shield.
HKR breakdown
hook ✓knowledge ✓resonance ✓