autojanet/skills/taste-skill/research/laziness/findings/references.md
Zoë cfec11bb46
Some checks failed
ci/woodpecker/push/woodpecker Pipeline failed
fix: convert skill submodules to plain directories
stop-slop, taste-skill, terrashark had embedded .git dirs causing
Woodpecker clone to fail on submodule update.
2026-05-30 15:44:44 -07:00

1.5 KiB

References

Cited Studies

  • EmotionPrompt (Microsoft Research) — Demonstrates that emotional and stakes-based prompt framing mathematically improves LLM reasoning quality and output length. Documents the +45% improvement from financial framing and +115% from combined stimuli.

  • LazyBench — Proves that frontier models (Gemini 1.5 Pro, GPT-4o) actively select cognitive shortcuts and fail tasks they are capable of solving when the perceived effort exceeds internal thresholds.

  • Compounding Error Avoidance — Research demonstrating that models truncate outputs as a risk mitigation strategy, preferring shorter responses to reduce the surface area for factual errors on long-form tasks.

  • Seasonal Behavior Analysis (Winter Break Hypothesis) — Statistical analysis confirming that LLMs internalize seasonal work patterns from training data, producing measurably shorter outputs during periods corresponding to human holiday seasons.

  • 2025 Controlled Laziness Experiments — Three-part academic study (December 2025) confirming that output truncation is a behavioral artifact of alignment training, not a failure of context processing or model capability.

Further Reading

  • Google Gemini API documentation on thinking_level parameter configuration
  • Anthropic MCP (Model Context Protocol) specification and integration guides
  • OpenAI API reference for temperature and Top-p parameter tuning
  • YAML front-matter specification for SKILL.md lazy-loading architecture