AI deleted my most tests, and said "All Tests Pass"

aautobe about 9 hours ago 2 commentsRead Article on typia.io

DE version is available. Content is displayed in original English for accuracy.

⚡ Community Insights

Discussion Sentiment

33% Positive

Analyzed from 90 words in the discussion.

Discussion (2 Comments)Read Original on HackerNews

benchwright•about 8 hours ago

Tends to be a problem. I've tried to mitigate these problems by using either external harnesses (aka GitHub actions that are "fixed" based on known-good) or by using n-number of witness agents (e.g. Kimi/Qwen/whatever <=> Claude/OpenAI/Google). Generally sucks more time and energy (and now token/$).

that being said, I still have a "fix the code, not the test" line somewhere in here...

cyanydeez•about 2 hours ago

believing that oneshot prompt is enough specification is pretty delusional.

I keep seeing people talk about the power of these SOTA models, yet keep reading the types of prompts that make no sense to anyone who understands the ludicrous number of decisions that would need to be made.