Run Detail · cb5e74d4b275

$0.8331

Total cost

9

API calls

153,946

Tokens in

5%

Cache hit

Steps in this run

Step	Calls	Tokens in	Cache hit	Cost
ranking	2	127,153	3%	$0.76464
response generation	3	6,711	0%	$0.03315
learning engine pattern analysis	1	13,556	0%	$0.01504
haiku prescreen	2	10,158	48%	$0.01154
learning engine self eval	1	4,618	0%	$0.00869

All 9 API calls — tap to expand

1

haiku prescreen Haiku batch 14

04:46:55

$0.00941 5772ms

›

3,141

Tokens in (billed)

0

Cache read tokens

4,894

Cache written

501

Tokens out

0%

Cache hit rate

$0.009411

Est. cost (USD)

Result preview

```json [ { "post_index": 3, "cluster_ids": [1, 23], "claim": "AI second opinions lack rigorous prospective trials in real clinical workflows", "argument_type": "question_or_critique", "stance": "challenges_status_quo", "hyde_excerpt": "Clinical validation of AI diagnostic assistance remains fundamentally incomplete. While observational studies suggest potential utility,

2

ranking Sonnet batch 14

04:47:05

$0.47063 9446ms

›

84,065

Tokens in (billed)

0

Cache read tokens

56,565

Cache written

421

Tokens out

0%

Cache hit rate

$0.470629

Est. cost (USD)

Result preview

```json [ { "post_index": 7, "matched_article_id": 253, "match_confidence": 88, "match_reason": "The tweet argues that using outdated AI models instead of current SOTA constitutes clinical malpractice — directly engaging the article's central thesis that AI diagnostic superiority creates legal duty for physicians to adopt available AI tools, with the tweet explicitly claiming 'no

3

response generation Sonnet

04:47:12

$0.01056 7653ms

›

2,015

Tokens in (billed)

0

Cache read tokens

0

Cache written

301

Tokens out

0%

Cache hit rate

$0.010560

Est. cost (USD)

Result preview

The malpractice argument actually gets sharper when you follow the FDA approval mechanism. Once an adaptive AI algorithm clears FDA review, it doesn't stay optional for long. That regulatory stamp is precisely what courts and expert witnesses use to establish what's "standard in the field," and that's the moment physician non-adoption stops being conservative judgment and starts looking like a bre

4

response generation Sonnet

04:47:21

$0.01213 9184ms

›

2,210

Tokens in (billed)

0

Cache read tokens

0

Cache written

367

Tokens out

0%

Cache hit rate

$0.012135

Est. cost (USD)

Result preview

The prospective trial gap is real, but there's a subtler problem that tends to get skipped in this conversation: even when rigorous trials do exist, the findings don't always point where people expect them to. The DeepSeek-R1 RCT I wrote about recently randomized 32 critical care residents across six hospitals on diagnostically challenging cases. The AI alone hit 60 percent top-1 accuracy, reside

5

response generation Sonnet

04:47:27

$0.01046 5280ms

›

2,486

Tokens in (billed)

0

Cache read tokens

0

Cache written

200

Tokens out

0%

Cache hit rate

$0.010458

Est. cost (USD)

Result preview

The question this raises for anyone paying for these drugs at scale: if the clinical benefit timeline is months to years, but only 1-in-12 patients remain on therapy after three years, who actually captures the return on that investment? That persistence gap is the center of gravity for every employer coverage decision right now. And 59% of large employers are already reporting higher-than-expect

6

haiku prescreen Haiku batch 3

04:48:14

$0.00213 555ms

›

2,123

Tokens in (billed)

4,894

Cache read tokens

0

Cache written

9

Tokens out

70%

Cache hit rate

$0.002126

Est. cost (USD)

Result preview

```json [] ```

7

ranking Sonnet batch 3

04:48:17

$0.29401 2340ms

›

39,732

Tokens in (billed)

3,356

Cache read tokens

46,333

Cache written

4

Tokens out

8%

Cache hit rate

$0.294012

Est. cost (USD)

Result preview

[]

8

learning engine self eval Haiku

04:48:26

$0.00869 8463ms

›

4,618

Tokens in (billed)

0

Cache read tokens

0

Cache written

1,248

Tokens out

0%

Cache hit rate

$0.008686

Est. cost (USD)

Result preview

```json [ {"post_index": 0, "prediction": "reject", "confidence": 95, "reason": "Off-topic (military/drones), not healthcare-related"}, {"post_index": 1, "prediction": "reject", "confidence": 95, "reason": "Off-topic (sports/horses), not healthcare-related"}, {"post_index": 2, "prediction": "r

9

learning engine pattern analysis Haiku

04:48:38

$0.01504 9987ms

›

13,556

Tokens in (billed)

0

Cache read tokens

0

Cache written

1,049

Tokens out

0%

Cache hit rate

$0.015041

Est. cost (USD)

Result preview

```json [ { "category": "ai_safety_vulnerability_incident_tangential", "summary": "Posts about AI agent security breaches, hacks, or safety vulnerabilities without healthcare system application context.", "exclusion_rule": "Exclude posts that report AI safety incidents, security vulner