MoreRSS

site iconGeoffrey LittModify

Researcher explores malleable software and AI, with a PhD from MIT and work at Ink & Switch.
Please copy the RSS to your reader, or quickly subscribe to:

Inoreader Feedly Follow Feedbin Local Reader

Rss preview of twitter of Geoffrey Litt

So cool to see this resource that @schickling and @jessmartin have created! A lot of hard work and so much value to the community

2025-04-30 09:19:13

So cool to see this resource that @schickling and @jessmartin have created! A lot of hard work and so much value to the community



localfirst.fm Podcast: Announcing the Local-First Landscape

A comprehensive guide to help developers choose the right tools for building local-first applications.

Wow, o3 is a surprisingly good http://Mint.com replacement. Try sharing a pdf credit card statement and asking for a categorized breakdown, and then d...

2025-04-28 07:27:53

Wow, o3 is a surprisingly good http://Mint.com replacement.

Try sharing a pdf credit card statement and asking for a categorized breakdown, and then discuss opportunities to save money.

Really nice that it makes good guesses about what entries are referring to

o3 did a great job helping me debug some code today. the key is print debugging. here's my workflow: - ask the model to suggest a few theories for wha...

2025-04-25 10:35:59

o3 did a great job helping me debug some code today.

the key is print debugging. here's my workflow:

- ask the model to suggest a few theories for what might be wrong and then to add print statements that would provide useful evidence
- copy-paste the resulting logs back into the chat
- ask the model to provide an explanation for how the logs support the various theories.
- actually read its explanation and think about whether it makes any sense! (in this case today the explanation was great! see the example output in screenshot)
- only proceed with a fix once you're convinced of the explanation

in my experience this works much better than just having it read the code. for humans and AIs, debugging benefits greatly from dynamic feedback loops!

i've been doing this workflow since 2022 with gpt-3, but it seems like the latest reasoning models have gotten really good. which makes sense - testing observed logs against explanations seems like the kind of task that reasoning would naturally help with.

once models can drive interactive debuggers or even explore recorded execution traces via tools, this workflow is gonna get way nicer. (lmk if you know of tools that do this already?)

Man,this is such a beautiful video series. Seeing the amount of work it takes to build a good house is awe-inspiring. https://youtu.be/d7e6IOdKfVw?si=...

2025-04-23 09:37:24

Man,this is such a beautiful video series.

Seeing the amount of work it takes to build a good house is awe-inspiring.

https://youtu.be/d7e6IOdKfVw?si=QlQmPnZ7aj-JSVGa