ENGAGE run · cb5e74d4b275

Started 2026-05-03 04:46 UTC
$0.8331
Total cost
9
API calls
153,946
Tokens in
5%
Cache hit
Steps in this run
Step Calls Tokens in Cache hit Cost
ranking 2 127,153
3%
$0.76464
response generation 3 6,711
0%
$0.03315
learning engine pattern analysis 1 13,556
0%
$0.01504
haiku prescreen 2 10,158
48%
$0.01154
learning engine self eval 1 4,618
0%
$0.00869
All 9 API calls — tap to expand
1
haiku prescreen Haiku batch 14
04:46:55
$0.00941 5772ms
3,141
Tokens in (billed)
0
Cache read tokens
4,894
Cache written
501
Tokens out
0%
Cache hit rate
$0.009411
Est. cost (USD)
Result preview
```json [ { "post_index": 3, "cluster_ids": [1, 23], "claim": "AI second opinions lack rigorous prospective trials in real clinical workflows", "argument_type": "question_or_critique", "stance": "challenges_status_quo", "hyde_excerpt": "Clinical validation of AI diagnostic assistance remains fundamentally incomplete. While observational studies suggest potential utility,
2
ranking Sonnet batch 14
04:47:05
$0.47063 9446ms
84,065
Tokens in (billed)
0
Cache read tokens
56,565
Cache written
421
Tokens out
0%
Cache hit rate
$0.470629
Est. cost (USD)
Result preview
```json [ { "post_index": 7, "matched_article_id": 253, "match_confidence": 88, "match_reason": "The tweet argues that using outdated AI models instead of current SOTA constitutes clinical malpractice — directly engaging the article's central thesis that AI diagnostic superiority creates legal duty for physicians to adopt available AI tools, with the tweet explicitly claiming 'no
3
response generation Sonnet
04:47:12
$0.01056 7653ms
2,015
Tokens in (billed)
0
Cache read tokens
0
Cache written
301
Tokens out
0%
Cache hit rate
$0.010560
Est. cost (USD)
Result preview
The malpractice argument actually gets sharper when you follow the FDA approval mechanism. Once an adaptive AI algorithm clears FDA review, it doesn't stay optional for long. That regulatory stamp is precisely what courts and expert witnesses use to establish what's "standard in the field," and that's the moment physician non-adoption stops being conservative judgment and starts looking like a bre
4
response generation Sonnet
04:47:21
$0.01213 9184ms
2,210
Tokens in (billed)
0
Cache read tokens
0
Cache written
367
Tokens out
0%
Cache hit rate
$0.012135
Est. cost (USD)
Result preview
The prospective trial gap is real, but there's a subtler problem that tends to get skipped in this conversation: even when rigorous trials do exist, the findings don't always point where people expect them to. The DeepSeek-R1 RCT I wrote about recently randomized 32 critical care residents across six hospitals on diagnostically challenging cases. The AI alone hit 60 percent top-1 accuracy, reside
5
response generation Sonnet
04:47:27
$0.01046 5280ms
2,486
Tokens in (billed)
0
Cache read tokens
0
Cache written
200
Tokens out
0%
Cache hit rate
$0.010458
Est. cost (USD)
Result preview
The question this raises for anyone paying for these drugs at scale: if the clinical benefit timeline is months to years, but only 1-in-12 patients remain on therapy after three years, who actually captures the return on that investment? That persistence gap is the center of gravity for every employer coverage decision right now. And 59% of large employers are already reporting higher-than-expect
6
haiku prescreen Haiku batch 3
04:48:14
$0.00213 555ms
2,123
Tokens in (billed)
4,894
Cache read tokens
0
Cache written
9
Tokens out
70%
Cache hit rate
$0.002126
Est. cost (USD)
Result preview
```json [] ```
7
ranking Sonnet batch 3
04:48:17
$0.29401 2340ms
39,732
Tokens in (billed)
3,356
Cache read tokens
46,333
Cache written
4
Tokens out
8%
Cache hit rate
$0.294012
Est. cost (USD)
Result preview
[]
8
learning engine self eval Haiku
04:48:26
$0.00869 8463ms
4,618
Tokens in (billed)
0
Cache read tokens
0
Cache written
1,248
Tokens out
0%
Cache hit rate
$0.008686
Est. cost (USD)
Result preview
```json [ {"post_index": 0, "prediction": "reject", "confidence": 95, "reason": "Off-topic (military/drones), not healthcare-related"}, {"post_index": 1, "prediction": "reject", "confidence": 95, "reason": "Off-topic (sports/horses), not healthcare-related"}, {"post_index": 2, "prediction": "r
9
learning engine pattern analysis Haiku
04:48:38
$0.01504 9987ms
13,556
Tokens in (billed)
0
Cache read tokens
0
Cache written
1,049
Tokens out
0%
Cache hit rate
$0.015041
Est. cost (USD)
Result preview
```json [ { "category": "ai_safety_vulnerability_incident_tangential", "summary": "Posts about AI agent security breaches, hacks, or safety vulnerabilities without healthcare system application context.", "exclusion_rule": "Exclude posts that report AI safety incidents, security vulner