I’m @froztbyte more or less everywhere that matters

  • 1 Post
  • 199 Comments
Joined 2 years ago
cake
Cake day: July 2nd, 2023

help-circle













  • previous stubsack guest star Cursor rejoins the show, using a shitty liarsynth to automatically tell users broken behaviour is expected (cw: orange site), followed by people mass-killing their subscriptions

    Earlier today Cursor, the magical AI-powered IDE started kicking users off when they logged in from multiple machines. Like,you’d be working on your desktop, switch to your laptop, and all of a sudden you’re forcibly logged out. No warning, no notification, just gone.

    Naturally, people thought this was a new policy.

    So they asked support.

    And here’s where it gets batshit: Cursor has a support email, so users emailed them to find out. The support peson told everyone this was “expected behavior” under their new login policy

    One problem. There was no support team, it was an AI designed to ‘mimic human responses’

    haven’t gotten into the replies to look for sneers yet but I bet there will be some






  • lol, yeah

    “perverse incentives rule everything around me” is a big thing (observable) in “startup”[0] world because everything[1] is about speed/iteration. for example: why bother spending a few weeks working out a way to generate better training data for a niche kind of puzzle test if you can just code in “personality” and make the autoplag casinobot go “hah, I saw a puzzle almost like this just last week, let’s see if the same solution works…”

    i.e. when faced with a choice of hard vs quick, cynically I’ll guess the latter in almost all cases. there are occasional exceptions, but none of the promptfondlers and modelfarmers are in that set imo

    [0] - look, we may wish to argue about what having billions in vc funding categorizes a business as. but apparently “immature shitderpery” is still squarely “startup”

    [1] - in the bayfucker playbook. I disagree.


  • (excuse possible incoherence it’s 01:20 and I’m entirely in filmbrain (I’ll revise/edit/answer questions in morning))

    re (1): while that is a possibility, keep in mind that all this shit also operates/exists in a metrics-as-targets obsessed space. they might not present end user with hit% but the number exists, and I have no reason to believe that isn’t being tracked. combine that with social effects (public humiliation of their Shiny New Model, monitoring usage in public, etc etc) - that’s where my thesis of directed prompt-improvement is grounded

    re (2): while they could do something like that (synthetic derivation, etc), I dunno if that’d be happening for this. this is outright a guess on my part, a reach based on character based on what I’ve seen from some the field, but just……I don’t think they’d try that hard. I think they might try some limited form of it, but only so much as can be backed up in relatively little time and thought. “only as far as you can stretch 3 sprints” type long

    (the other big input in my guesstimation re (2) is an awareness of the fucked interplay of incentives and glorycoders and startup culture)