@sabik @futurebird @bri_seven this is exactly what they do, and it’s surprisingly effective because of the feedback loop. Unlike the pure LLM output, it’s now closer to classic evolutionary design with a generative component plus a fitness component, and can iterate until it produces a working program. Of course this assumes the test is described correctly, and it only works for programs that can be tested that way, but when it works it’s impressive.

alec@perkins.pub
@alec@perkins.pub
A forum for discussing and organizing recreational softball and baseball games and leagues in the greater Halifax area.