Run Detail · 37eef4d73774

$0.8661

Total cost

10

API calls

165,146

Tokens in

5%

Cache hit

Steps in this run

Step	Calls	Tokens in	Cache hit	Cost
ranking	2	132,044	3%	$0.78051
response generation	4	9,626	0%	$0.04533
haiku prescreen	2	12,900	38%	$0.01618
learning engine pattern analysis	1	13,398	0%	$0.01467
learning engine self eval	1	5,408	0%	$0.00940

All 10 API calls — tap to expand

1

haiku prescreen Haiku batch 33

21:41:32

$0.01409 12913ms

›

5,952

Tokens in (billed)

0

Cache read tokens

4,863

Cache written

1,116

Tokens out

0%

Cache hit rate

$0.014089

Est. cost (USD)

Result preview

```json [ { "post_index": 0, "cluster_ids": [24], "claim": "J-1 visa delays force U.S.-trained international physicians to leave healthcare system", "argument_type": "empirical_claim", "stance": "challenges_status_quo", "hyde_excerpt": "The J-1 visa waiver program delays are creating an unintended physician supply shock in safety-net and rural healthcare delivery. As hund

2

ranking Sonnet batch 33

21:41:44

$0.48860 11997ms

›

89,640

Tokens in (billed)

0

Cache read tokens

56,534

Cache written

512

Tokens out

0%

Cache hit rate

$0.488602

Est. cost (USD)

Result preview

```json [ { "post_index": 17, "matched_article_id": 453, "match_confidence": 87, "match_reason": "The tweet argues that AI drug discovery models overfit on positive binding data due to lack of published null results — directly addressing the article's thesis that AI has compressed preclinical discovery but exposed downstream evidence infrastructure failures, including training da

3

response generation Sonnet

21:41:51

$0.01099 6893ms

›

2,324

Tokens in (billed)

0

Cache read tokens

0

Cache written

268

Tokens out

0%

Cache hit rate

$0.010992

Est. cost (USD)

Result preview

Roughly one in five real-world oncology patients would not qualify for the phase 3 trials that generated the binding and efficacy data these models are trained on, which means the selection problem runs deeper than just missing negatives. The training data gap you're describing (molecules that failed to bind) has a structural cousin in clinical translation: the populations that generated positive

4

response generation Sonnet

21:41:57

$0.01234 6156ms

›

2,687

Tokens in (billed)

0

Cache read tokens

0

Cache written

285

Tokens out

0%

Cache hit rate

$0.012336

Est. cost (USD)

Result preview

The diagnostic accuracy finding is real and the research is solid. But there's a structural problem sitting right underneath it that this framing skips over entirely. When an LLM outperforms a physician and the physician follows that recommendation and the patient is harmed anyway, the physician gets sued. The LLM vendor does not. That's not hypothetical, that's the current contractual and regula

5

response generation Sonnet

21:42:03

$0.01005 5982ms

›

2,236

Tokens in (billed)

0

Cache read tokens

0

Cache written

223

Tokens out

0%

Cache hit rate

$0.010053

Est. cost (USD)

Result preview

The build cost was prohibitive, so you bought a vendor platform instead. That vendor's moat was never the idea, it was that you couldn't afford to replicate it. That's the exact dynamic I traced through healthcare in https://www.onhealthcare.tech/p/the-free-lunch-is-over-except-now?utm_source=x&utm_medium=reply&utm_content=2050684160151617603&utm_campaign=the-free-lunch-is-over-except-now, where a

6

response generation Sonnet

21:42:12

$0.01195 8496ms

›

2,379

Tokens in (billed)

0

Cache read tokens

0

Cache written

321

Tokens out

0%

Cache hit rate

$0.011952

Est. cost (USD)

Result preview

The 2013 ACC/AHA guideline shift alone added an estimated 12.8 million newly statin-eligible Americans overnight, and the calculator used to justify it was later shown to overestimate cardiovascular risk by 75-150% in external validation cohorts. But the threshold-moving dynamic you're describing is actually the mechanism that makes cost-plus drug pricing so disruptive as a business model. When g

7

haiku prescreen Haiku batch 3

21:42:57

$0.00209 715ms

›

2,085

Tokens in (billed)

4,863

Cache read tokens

0

Cache written

9

Tokens out

70%

Cache hit rate

$0.002093

Est. cost (USD)

Result preview

```json [] ```

8

ranking Sonnet batch 3

21:42:59

$0.29190 2230ms

›

39,037

Tokens in (billed)

3,367

Cache read tokens

46,306

Cache written

9

Tokens out

8%

Cache hit rate

$0.291904

Est. cost (USD)

Result preview

```json [] ```

9

learning engine self eval Haiku

21:43:09

$0.00940 9732ms

›

5,408

Tokens in (billed)

0

Cache read tokens

0

Cache written

1,268

Tokens out

0%

Cache hit rate

$0.009398

Est. cost (USD)

Result preview

```json [ {"post_index": 0, "prediction": "reject", "confidence": 92, "reason": "Non-healthcare content, vague praise without substance"}, {"post_index": 1, "prediction": "reject", "confidence": 95, "reason": "Airline link, not healthcare-related"}, {"post_index": 2, "prediction": "reject", "c

10

learning engine pattern analysis Haiku

21:43:20

$0.01467 9236ms

›

13,398

Tokens in (billed)

0

Cache read tokens

0

Cache written

989

Tokens out

0%

Cache hit rate

$0.014674

Est. cost (USD)

Result preview

```json [ { "category": "ai_safety_vulnerability_incident_tangential", "summary": "Posts about AI safety incidents, security breaches, or vulnerability exploits that lack healthcare system context or application.", "exclusion_rule": "Exclude posts describing AI model vulnerabilities, s