papers · 2026-05-03

▸ 9 papers · updated 3m ago

May 2026

MTWTFSS

1150 2 39 4190 5295 6173 7203 8283 9 10 11296 12541 13284 14248 15209 16 17 18224 19490 20273 21240 2223 23185 24 25200 26431 27258 28251 29257 30 31

June 2026

MTWTFSS

1261 2473 3215 4239 57 6 7 8173 9377101112131415161718192021222324252627282930

2026-05-03 · Sun

23:16

36d ago

FEATUREDHuggingFace Papers (takara mirror)· rssEN23:16 · 05·03

→Model Spec Midtraining: Improving How Alignment Training Generalizes

The authors introduce Model Spec Midtraining, training models on synthetic Model Spec documents after pretraining and before alignment fine-tuning, and report that a self-preservation and goal-guarding spec cuts Qwen3-32B’s agentic misalignment rate from 54% to 7%, versus 14% for a deliberative alignment baseline.

#Alignment#Safety#Fine-tuning#Qwen

why featured

HKR-H/K/R all pass: the paper gives a concrete midtraining mechanism and a Qwen3-32B agentic misalignment drop from 54% to 7%. This is strong safety research, not a major model launch, so it fits the 78–84 band.

editor take

MSM front-loads the spec before SFT and cuts Qwen3-32B misalignment from 54% to 7%; that’s a sharper fix than more RLHF demos.

sharp

MSM lands because it admits demos do not teach the boundary of generalization. The authors insert synthetic Model Spec documents after pretraining and before alignment fine-tuning. The same cheese-preference SFT can generalize into pro-America values or affordability values, depending on the spec narrative. That is a clean probe: the model is not only copying behavior; it is anchoring behavior to a story. The Qwen3-32B number is hard to ignore: agentic misalignment drops from 54% to 7%, beating a deliberative alignment baseline at 14%. I still have doubts. Synthetic specs hand the value interpretation layer to whoever writes the documents. OpenAI’s Model Spec and Anthropic’s Constitution run into the same pressure: once the spec gets legalistic, hard cases become interpretation contests, not alignment wins.

HKR breakdown

hook ✓knowledge ✓resonance ✓

→ open source

SCORE

H1·K1·R1

22:46

36d ago

FEATUREDHuggingFace Papers (takara mirror)· rssEN22:46 · 05·03

→EditPropBench: Measuring Factual Edit Propagation in Scientific Manuscripts

EditPropBench tests whether LLM editors propagate factual edits across dependent claims in synthetic ML/NLP manuscripts with controlled fact graphs, and five systems score 0.148-0.705 ERA on the hardest cases, where even the strongest misses about 30% of required cascade updates.

#Benchmarking#EditPropBench#arXiv#Research release

why featured

HKR-H/K/R all pass, but this is a single benchmark paper without a major-lab release or cross-source traction. Clear ERA numbers and setup put it at the lower featured band.

editor take

EditPropBench hits the ugly gap in LLM editing: sentence polish is cheap; maintaining the paper’s fact graph still fails hard.

sharp

EditPropBench puts LLM editing back where it hurts: factual maintenance, not prose cleanup. The test avoids easy “215 to 80” substitution and checks whether dependent phrases like “medium-scale” or “a few hundred items” get revised too. The authors also audited recent arXiv cs.CL benchmark and dataset papers and found fact-dependent qualitative claims in 37.2% of them. The ugly number is ERA 0.148-0.705 on the hardest cases, with the best system still missing about 30% of required cascade edits. That is brutal for scientific-writing copilots. One missed downstream claim can poison a results section. I don’t buy “automatic manuscript editing” as a serious claim unless the system carries sentence-level dependency graphs or runs a cascade checker after every factual edit.

HKR breakdown

hook ✓knowledge ✓resonance ✓

→ open source

SCORE

H1·K1·R1

22:36

36d ago

HuggingFace Papers (takara mirror)· rssEN22:36 · 05·03

→Cripping AI: Reimagining AI Through Lived Disability Experiences

The paper proposes cripping AI as a framework and applies it to 3 cases: deafness and sign language AI, blindness and visual assistive AI, and stuttering and speech AI.

#Safety#Alignment#Multimodal#Research release

why featured

HKR-H/K/R all pass, but the post offers a framework and 3 cases without empirical results, model data, or reproducible tests. That keeps it in all, below the 72 featured line.

editor take

The paper uses 3 cases to attack ableist evals; accessibility as a patch leaves datasets, metrics, and product assumptions rotten.

HKR breakdown

hook ✓knowledge ✓resonance ✓

→ open source

SCORE

H1·K1·R1

22:01

36d ago

HuggingFace Papers (takara mirror)· rssEN22:01 · 05·03

→Enhanced LLM Reasoning by Optimizing Reward Functions with Search-Driven Reinforcement Learning

The paper optimizes reward functions while holding Llama-3.2-3B-Instruct fixed, generating 50 candidates over five rounds and reaching F1 0.795 on GSM8K with the best ensemble.

#Reasoning#Alignment#Fine-tuning#Llama

why featured

HKR-K passes via concrete setup and GSM8K result; HKR-H and HKR-R are weak because this is a method paper with limited industry pull. Scored in the 60-71 research band.

editor take

Fixed Llama-3.2-3B hits 0.795 GSM8K F1 via reward search; random five-reward control at 0.047 makes this credible.

HKR breakdown

hook —knowledge ✓resonance —

→ open source

SCORE

H0·K1·R0

18:32

36d ago

HuggingFace Papers (takara mirror)· rssEN18:32 · 05·03

→Enhancing Judgment Document Generation via Agentic Legal Information Collection and Rubric-Guided Optimization

Judge-R1 uses a dynamic planning agent to retrieve statutes and precedents from multiple sources, then applies GRPO with a legal reward function to optimize judgment document generation; experiments use the JuDGE benchmark, but the post does not disclose exact scores.

#Agent#RAG#Reasoning#Judge-R1

why featured

HKR-K passes via a testable agentic RAG plus GRPO reward setup, but HKR-H is a dry academic title and HKR-R stays narrow to legal-AI builders. No hard exclusion; low-60s all-tier signal.

editor take

Judge-R1 adds agentic legal retrieval plus GRPO; no JuDGE scores are disclosed, so treat “significantly outperforms” as unproven.

HKR breakdown

hook —knowledge ✓resonance —

→ open source

SCORE

H0·K1·R0

17:47

36d ago

HuggingFace Papers (takara mirror)· rssEN17:47 · 05·03

→Phase-Aware Bounded-Loss Transport for Distributed Machine Learning Training

DBLP adjusts gradient loss tolerance by training phase and cuts end-to-end training time by 24.4% on average, with a 33.9% maximum reduction, while reaching up to 5.88x single-round communication latency speedups during microburst events versus the baseline.

#Fine-tuning#Inference-opt#DBLP#Research release

why featured

HKR-H/K/R pass, but the story is narrow distributed-training transport rather than a broad model or product release. Concrete speedup numbers keep it in all, below featured.

editor take

DBLP cuts training time 24.4% on average. But model scale, topology, and baseline are undisclosed, so 5.88x is not portable yet.

HKR breakdown

hook ✓knowledge ✓resonance ✓

→ open source

SCORE

H1·K1·R1

17:29

36d ago

FEATUREDHuggingFace Papers (takara mirror)· rssEN17:29 · 05·03

→DexSim2Real: Foundation Model-Guided Sim-to-Real Transfer for Generalizable Dexterous Manipulation

DexSim2Real achieves a 78.2% average real-world success rate across six dexterous manipulation tasks, using FM-DR, TVCAP, and PSC to reduce the sim-to-real performance gap to 8.3% under blinded evaluation.

#Robotics#Vision#Agent#DexSim2Real

why featured

HKR-H/K/R pass, but this is a single robotics research item with no disclosed code, replication details, or production deployment. It fits the 72–77 featured-threshold band, not same-day must-write.

editor take

78.2% real-world success is solid, but this is not general robotics solved; FM-guided randomization is the part I’d try to reproduce.

sharp

DexSim2Real matters because it attacks sim-to-real tuning, not because it proves “foundation-model robotics.” The paper reports 78.2% average real-world success across six dexterous tasks and cuts the sim-to-real gap to 8.3%. The useful mechanism is FM-DR: a vision-language model acts as a visual realism critic, then CMA-ES searches simulator parameters. That is a cleaner idea than asking an LLM to write randomization rules, which is where DrEureka-style methods still feel brittle. I still don’t fully buy the headline number. The snippet says blinded evaluation and claims wins over DrEureka and DeXtreme, but it does not expose per-task, per-object, or perturbation breakdowns. Dexterous manipulation averages hide a lot; two forgiving contact tasks can make a policy look far more transferable than it is.

HKR breakdown

hook ✓knowledge ✓resonance ✓

→ open source

SCORE

H1·K1·R1

16:45

36d ago

FEATUREDHuggingFace Papers (takara mirror)· rssEN16:45 · 05·03

→Flexi-LoRA Paper: Input-Adaptive Rank Efficient Finetuning Method

Flexi-LoRA adjusts LoRA ranks by input complexity during both training and inference, and the paper reports higher performance than static LoRA with fewer parameters across question answering, mathematical reasoning, and speech tasks; the post does not disclose the evaluated base models, parameter counts, or benchmark scores.

#Fine-tuning#Reasoning#Audio#Research release

why featured

HKR-K and HKR-R pass: the mechanism is relevant to fine-tuning practice and cost. HKR-H is weak, and missing model names, parameter counts, and scores keep it in the 60–71 research-signal band.

editor take

Flexi-LoRA moves rank selection to the input level, which is the right instinct; without baseline numbers, don’t crown it yet.

sharp

Both sources reuse the same arXiv 2605.01959 paper title, so the coverage is aligned through a paper-distribution chain, not independent validation. Flexi-LoRA makes a clean bet: static LoRA assigns one rank budget to every input, wasting capacity on easy cases and under-serving harder reasoning or speech samples. I like the direction, but the abstract is too victory-lap for the evidence shown here. It says Flexi-LoRA adjusts ranks during both training and inference, beats static LoRA with fewer parameters, and works across QA, math reasoning, and speech tasks. The disclosed text does not give model names, rank ranges, parameter savings, or benchmark scores. The credible hook is narrower: math reasoning shows higher dependency on rank dynamics than QA. This smells like MoE-style conditional compute pushed down into adapters; the failure mode is the complexity estimator misrouting chain-of-thought-heavy problems.

HKR breakdown

hook —knowledge ✓resonance ✓

→ open source

SCORE

H0·K1·R1

07:58

37d ago

FEATUREDHuggingFace Papers (takara mirror)· rssEN07:58 · 05·03

→Research proposes inference-time relevance propagation method to reduce multimodal LLM hallucinations

The paper proposes LIME, a training-free inference-time method that uses Layer-wise Relevance Propagation to score token contributions and update key-value representations without changing model parameters or adding training data; the snippet says it reduces hallucinations on vision and audio benchmarks, but does not disclose numeric results.

#Multimodal#Vision#Audio#Research release

why featured

HKR-H/K/R all pass, but the post gives no benchmark deltas and is a single paper summary. The inference-time KV mechanism clears featured, not the 78+ research tier.

editor take

Both sources trace to arXiv 2605.01766; LIME moves hallucination mitigation into KV-cache edits—clever, but no effect size means no victory lap.

sharp

Both sources carry the same paper title, and Hugging Face/Takara points back to arXiv 2605.01766, so this is a single-source research signal rather than independent confirmation. LIME’s claim is precise: use Layer-wise Relevance Propagation during decoding, then update key-value representations to increase reliance on vision or audio inputs, without parameter changes or new training data. I like the framing because it attacks a real MLLM failure mode: text priors overpower perceptual tokens at inference. That is sharper than another generic “alignment” fix. But the abstract only says hallucinations drop across multiple vision and audio benchmarks; it does not give model names, effect sizes, latency, or how many KV update steps are required. Against training-free methods like IVE, LIME lives or dies on overhead, not elegance.

HKR breakdown

hook ✓knowledge ✓resonance ✓

→ open source

SCORE

H1·K1·R1

papers · 2026-05-03

more

feeds

admin