Basically a deer with a human face. Despite probably being some sort of magical nature spirit, his interests are primarily in technology and politics and science fiction.

Spent many years on Reddit before joining the Threadiverse as well.

  • 0 Posts
  • 1.08K Comments
Joined 2 years ago
cake
Cake day: March 3rd, 2024

help-circle












  • The world is changing. It happens from time to time. In this case the change is a particularly big one and it’s still ongoing, so I can’t make any predictions about where it’s going to end. But I can be pretty confident that it’s not going to magically change back. So my best advice is to try out the new tools, see whether you can adapt to them and use them to improve your own productivity in new ways, and if not then as a fallback start looking at other directions to take your career.

    Harsh, perhaps, but the world does as the world does.







  • Social media is typically designed to create and strengthen social bubbles, bringing together like-minded people and showing them what they want to see. It is also designed to feed “engagement”. Rage is a great way to do that.

    Just look at the prevalence of upvoting and downvoting tools in various social media sites. A great way to ensure that opinions that are popular within a particular community become even more prominent, while driving out anything that isn’t popular within that community. Little wonder that views inside those bubbles become a bit skewed compared to the outside world as a whole.


  • You can predict how much a task will take in tokens. The accuracy of the prediction may not be perfect, but if you can ballpark it that can tell you a lot about what models to make use of.

    Also, not all tokens are the same. Different models require different amounts and kinds of computing power to run. Using a very large context costs more per token because you need a computer with a lot of memory to fit it all. If you need it fast that’s more expensive than if you an take your time. Does the task involve vision or audio? Does the context need to be saved for an ongoing chat? Does it need to wait for tool calls to return between rounds? There are a lot of variables that can be tweaked to vary the cost that an AI call will take, and a lot of those variables can be predicted without having to actually run the whole thing first.

    The “cranking up” part has not even started yet, and we already have stories like Uber which blew through their complete AI budget for the year,

    This is exactly what I’m talking about. Current LLM usage patterns tend to be pretty inefficient because people just thow tasks at the biggest and bestest models. Those models handle them, sure, because they’re the biggest and bestest. But most tasks don’t need that much.

    I’ve used coding agents a fair bit along with the various other AI applications I’ve fiddled with, and often I ask them to do things that are dead simple. Create a function to sort some data and select whatever fits certain criteria. Add type checking to a file. Create a unit test for a function. Stuff like that could easily be done by a small local model, but the coding agent sends it off to Opus or whatever just like every other task. That can change.

    There still was no guarantee that the output was useable (and there can’t be such a guarantee, since hallucinations are a statistical fact, increasing in occurrence with smaller amounts of training Data available).

    I don’t think you’ve used modern coding AIs much.

    Or, for that matter, worked with human coders.

    Remember, this is the “killer” application for LLMs.

    There is no one single “killer” application for LLMs. They’re about as general a computing platform as you can get.