view article Article Quantum Cryptanalysis on Real Hardware: Pushing Symmetric-Structure Key Recovery Beyond the Published Frontier FINAL-Bench β’ about 11 hours ago β’ 12
VKAE Accelerated Collection Fastest single-GPU serving of open models via VKAE. Live board: hf.co/spaces/VIDraft/vkae. Each = card + Docker. β’ 2 items β’ Updated 2 days ago β’ 13
view article Article Does Your LLM Know *When It's About to Be Wrong*? ginigen-ai β’ 5 days ago β’ 20
Metacognition Adapters Collection Per-model metacognition adapters from VIDRAFT Darwin/Chimera platform + AETHER metacognition-emergence technology. β’ 11 items β’ Updated 5 days ago β’ 22
view article Article Chitos: From Detection to Proof β An Autonomous Security AI That Actually Exploits FINAL-Bench β’ 6 days ago β’ 19
view article Article FINAL-Bench Quantum: An Open, Neutral Benchmark for Quantum-Computing Methods FINAL-Bench β’ 21 days ago β’ 17
view article Article Training-Free Reasoning at 88.89% on GPQA Diamond: How Darwin Family Hit Frontier Scores Without a Single Gradient Step FINAL-Bench β’ May 15 β’ 18
Darwin Family: MRI-Trust-Weighted Evolutionary Merging for Training-Free Scaling of Language-Model Reasoning Paper β’ 2605.14386 β’ Published May 14 β’ 67
view article Article Darwin-TTS: We Gave a TTS Model 3% of an LLM's Brain β It Started Showing Emotion FINAL-Bench β’ Apr 15 β’ 13
view article Article Darwin V6: Diagnostic-Guided Evolutionary Model Merging FINAL-Bench β’ Apr 8 β’ 12
view article Article "The Child That Surpassed Both Parents Through MRI-Guided Evolutionary Merge" FINAL-Bench β’ Mar 31 β’ 15
view article Article Introducing WM Bench: A Benchmark for Cognitive Intelligence in World Models FINAL-Bench β’ Mar 29 β’ 13
view article Article ποΈ Smol AI WorldCup: A 5-Axis Benchmark That Reveals What Small Language Models Can Really Do FINAL-Bench β’ Mar 10 β’ 38
view article Article MARL: Runtime Middleware That Reduces LLM Hallucination Without Fine-Tuning FINAL-Bench β’ Mar 9 β’ 16
view article Article Structural Problems in AI Benchmarking and the Case for a Unified Evaluation Framework FINAL-Bench β’ Mar 8 β’ 12
view article Article Do Bubbles Form When Tens of Thousands of AIs Simulate Capitalism? FINAL-Bench β’ Feb 24 β’ 17
view article Article FINAL Bench: The Real Bottleneck to AGI Is Self-Correction FINAL-Bench β’ Feb 21 β’ 20