9 comments

  • LatencyKills 15 hours ago
    Ex-Apple engineer here. I really like your implementation. A few years ago I built a similar tool to help me automate the testing of some of my native macOS apps. Being able to run multiple UI automation tests simultaneously was the big win in my case.

    My only criticism is enabling telemetry by default. I'm a fan of having people opt-in.

    • jorvi 10 hours ago
      The problem with opt-in telemetry is that 95% of users don't change defaults, and the 5% who do are your power users. They're not representative of the average user. And only a subset of them will turn it on

      Ironically enough the opposite happens with opt-out telemetry, for the same reason: a lot of power users will turn off telemetry, thus you will never see their usage patterns and will have to infer them. Dogfooding helps.

      • crazygringo 10 hours ago
        I'm confused.

        You claim power users opt in to telemetry, and then immediately say power users opt out.

        • jorvi 8 hours ago
          A subset of power users want to their usage to be profiled (me, if I trust the company. Brave, Mozilla, Mullvad, 1Password, Bitwarden, Valve, companies like that). But most power users will not want that because of privacy worries.

          From that you get two situations.

          Opt-in:

          - Regular users: click all 'ok' through setup at lightning speed, no telemetry enabled.

          - Most power users: consciously don't check the box to opt-in because of privacy worries.

          - Big picture power users: consciously check the opt-in box given they trust you (because they want their usage patterns to be profiled and optimized for).

          Opt-out:

          - Regular users: click all 'ok' through setup at lightning speed, telemetry enabled.

          - Most power users: consciously check the box to opt-out because of privacy worries.

          - Big picture power users: consciously don't check the opt-out box given they trust you (because they want their usage patterns to be profiled and optimized for).

        • awwaiid 8 hours ago
          power users opt in to opt in telemetry, and power users opt out of opt out telemetry. Power users click all the buttons.
      • jonhohle 7 hours ago
        If Charmin put sensors in toilet paper rolls to optimize the wiping experience, it would be dystopian. Why do we give software a pass? Privacy is a right not a telemetry problem and opt-out by default is non-consensual surveillance.
      • pnw_throwaway 10 hours ago
        The problem with opt-in telemetry is that 95% of users are sick and tired of being spied on with every little thing they do.
        • eddyg 4 hours ago
          Telemetry (if it’s truly telemetry) is nowhere close to “tracking”. People conflate the two all the time. One can provide useful, anonymous metrics (e.g. “user enabled feature X”) without doing anything but incrementing the counter for “feature X”.

          The “Firefox Problem” is that all the power users disable telemetry, so all the “cool” features that power users like (but never get used by “regular people”) get ignored or removed instead of improved because, according to the metrics, “nobody uses them”.

        • jorvi 8 hours ago
          If they really were they would turn it off. And stop using Gmail and Android.

          The overwhelming majority of people don't care about digital privacy because the cost is opaque to them.

          Also, telemetry when done right isn't "spying". Again, it is anonymized and used to see, for example, where the hot paths and paper cuts in applications are.

        • dewey 4 hours ago
          As you can see with TikTok / Instagram usage…regular people who are not on HN could not care less about that.
    • frabonacci 15 hours ago
      Fair criticism. We took a similar approach to established dev tools like Homebrew, with an anonymous, opt-out telemetry to understand install issues, crashes, and high-level usage. For cua-driver specifically, telemetry is limited to command/tool-level events and basic environment metadata. We don’t send screenshots, recordings, app contents, prompts, typed text, file paths, or tool arguments. That said, we should make the opt-out path clearer
    • kveykva 10 hours ago
      Would you be open to sharing what you built for running the automation tests? I could really use this right now.
      • frabonacci 10 hours ago
        We don't have a specific testing framework yet. cua-driver is closer to an automation interface than a test runner. that said, you could definitely build one on top of it. For reference these are some of our integration tests: https://github.com/trycua/cua/tree/main/libs/cua-driver/Test...

        One useful trick is to cua-driver 'launch_app' instead of the default 'open' or other osascript, since it can start the app without raising/focusing it, and the tests don't disturb your active desktop while they run

  • krackers 10 hours ago
    Nice! Thanks for the technical writeup, ~2 weeks from me wondering how it's implemented [1] to being able to play with a replicated version!

    [1] https://news.ycombinator.com/item?id=47799128

  • dtran 9 hours ago
    This is one of the coolest hacks I've seen recently. Having done some much less involved MacOS hacking, I can't help but wonder if we may finally see momentum behind some flavor of agent-friendly Linux/Android if Apple doesn't give us more ways to let agents interact with our machines.
    • frabonacci 9 hours ago
      really appreciate it. macOS has powerful primitives already, but they weren’t designed as one coherent agent API so you end up stitching together and hitting roadblocks. If Apple doesn't make this more first-class, Linux/Android-style environments may move faster because they’re easier to instrument. I think the OpenAI/Jony Ive AI hardware rumors are yet another signal that people may start building agent-native CUA devices instead of retrofitting agents onto existing desktops
  • pimlottc 8 hours ago
    What is specific about this for using with agents? As opposed to offering it as a general automation library for any use?
  • j-conn 8 hours ago
    Incredible! I’m interested in doing something similar on windows, have you looked into that at all? Apparently codex computer use plans to support this on windows in the future. Were you able to see how codex was doing it, or the inspiration was just “they’ve shown it’s possible”?
    • frabonacci 7 hours ago
      Thanks! We haven't gone deep on Windows yet because we're still focused on polishing the macOS release. We want to go deeper on the Mac experience before going broader across platforms, and there are still a lot of features we want to ship and use cases we want to share.
  • alsetmusic 10 hours ago
    I tried out their Loom vm software a couple of months back. Worked well, fwiw. I'm not using it anymore because I decided to just give agents direct (supervised) access to my devices.
    • prashant3210 48 minutes ago
      Same here. I give agents supervised direct access on my Mac for a side project. Session stealing is annoying. VM feels overkill for solo dev, but hate that the cursor jumps around while I try to do other things. Background driver sounds like the missing middle ground.
    • frabonacci 10 hours ago
      Thanks for trying out Lume! We definitely haven't given up on the idea of sandboxing GUI agents in local macOS VMs. Cua Driver is aimed at a different use case though, letting coding agents and general agents use the Mac you're already on, asynchronously and in the background. That also makes the economics better since multiple agents can share the same machine instead of each needing its own VM
  • BenFranklin100 4 hours ago
    Being new to the idea of using agents to run programs on one’s computer, could someone provide several use cases?
  • davey2wavey 10 hours ago
    Its looking great.

    The audit trail question is interesting and I haven't seen it come up much. When an agent clicks through an ERP or edits a file, you've got logs, but how do you explain the "why" behind each decision to, say, a compliance team?

    Curious if that's something you're thinking about or if it's too early.

  • dmazhukov 5 hours ago
    [dead]