Default OPENAI_REASONING_EFFORT to 'low' (not 'minimal')

Bentlybro · Bentlybro · commit 4935119ac28d · 2026-04-22T14:03:38.000+01:00
'minimal' is only valid on older gpt-5 / o-series models and is rejected
by gpt-5.4 with a 400 ("Supported values are: 'none', 'low', 'medium',
'high', 'xhigh'"). Our fail-safe silently approved every request when the
API errored, which presented as "moderation is fast but broken" in real
tests — scam / phishing / misinfo content was getting through.

'low' is the lowest common value accepted by every reasoning model in the
family so the default works regardless of which model you point the
OPENAI_CHAT_MODEL at. Empirically verified: 8/8 real-world classification
accuracy on gpt-5.4-nano with reasoning=low, vs 2/4 with reasoning=none
(false negatives on obvious scams).
diff --git a/config/config.py b/config/config.py
@@ -24,12 +24,17 @@ class Config:
     # AI moderator only returns small JSON responses (~100-200 tokens), so 500 is plenty
     OPENAI_MAX_OUTPUT_TOKENS = int(os.environ.get(
         'OPENAI_MAX_OUTPUT_TOKENS', '500'))
-    # Reasoning level for gpt-5 / o-series models. Valid values depend on the
-    # model; gpt-5.4 supports 'none' | 'low' | 'medium' | 'high' | 'xhigh';
-    # older gpt-5 / o-series accept 'minimal' | 'low' | 'medium' | 'high'.
-    # Lower = faster. Set to 'none' for pattern-matching workloads like
-    # content moderation where reasoning doesn't add signal.
-    OPENAI_REASONING_EFFORT = os.environ.get('OPENAI_REASONING_EFFORT', 'minimal')
+    # Reasoning level for gpt-5 / o-series models. Valid values are model-
+    # specific:
+    #   gpt-5 / o-series:   minimal | low | medium | high
+    #   gpt-5.4 (nano etc): none    | low | medium | high | xhigh
+    # 'low' is the only value accepted across every reasoning model in the
+    # supported family, so it's a safe default. Override per-deployment in
+    # .env if you've picked a model and want a different point on the
+    # latency/quality curve. Note that 'none' on gpt-5.4 measurably degrades
+    # moderation accuracy on borderline content (false negatives on scams,
+    # phishing, misinformation) — verified empirically on this codebase.
+    OPENAI_REASONING_EFFORT = os.environ.get('OPENAI_REASONING_EFFORT', 'low')
     ADMIN_EMAIL = os.environ.get('ADMIN_EMAIL')
     ADMIN_PASSWORD = os.environ.get('ADMIN_PASSWORD')