• PattyMcB@lemmy.world
    link
    fedilink
    English
    arrow-up
    1
    ·
    2 days ago

    I’m a software developer and I know that AI is just the shiny new toy from which everyone uses the buzzword to generate investment revenue.

    99% of the crap people use it for us worthless. It’s just a hammer and everything is a nail.

    It’s just like “the cloud” was 10 years ago. Now everyone is back-pedaling from that because it didn’t turn out to be the panacea that was promised.

  • brucethemoose@lemmy.world
    link
    fedilink
    English
    arrow-up
    55
    ·
    edit-2
    6 days ago

    It’s ironic how conservative the spending actually is.

    Awesome ML papers and ideas come out every week. Low power training/inference optimizations, fundamental changes in the math like bitnet, new attention mechanisms, cool tools to make models more controllable and steerable and grounded. This is all getting funded, right?

    No.

    Universities and such are seeding and putting out all this research, but the big model trainers holding the purse strings/GPU clusters are not using them. They just keep releasing very similar, mostly bog standard transformers models over and over again, bar a tiny expense for a little experiment here and there. In other words, it’s full corporate: tiny, guaranteed incremental improvements without changing much, and no sharing with each other. It’s hilariously inefficient. And it relies on lies and jawboning from people like Sam Altman.

    Deepseek is what happens when a company is smart but resource constrained. An order of magnitude more efficient, and even their architecture was very conservative.

  • PeteZa@lemm.ee
    link
    fedilink
    English
    arrow-up
    7
    ·
    5 days ago

    I used to support an IVA cluster. Now the only thing I use AI for is voice controls to set timers on my phone.

  • TommySoda@lemmy.world
    link
    fedilink
    English
    arrow-up
    36
    arrow-down
    1
    ·
    edit-2
    6 days ago

    Technology in most cases progresses on a logarithmic scale when innovation isn’t prioritized. We’ve basically reached the plateau of what LLMs can currently do without a breakthrough. They could absorb all the information on the internet and not even come close to what they say it is. These days we’re in the “bells and whistles” phase where they add unnecessary bullshit to make it seem new like adding 5 cameras to a phone or adding touchscreens to cars. Things that make something seem fancy by slapping buzzwords and features nobody needs without needing to actually change anything but bump up the price.

    • Balder@lemmy.world
      link
      fedilink
      English
      arrow-up
      7
      ·
      edit-2
      5 days ago

      I remember listening to a podcast that is about scientific explanations. The guy hosting it is very knowledgeable about this subject, does his research and talks to experts when the subject involves something he isn’t himself an expert.

      There was this episode where he kinda got into the topic of how technology only evolves with science (because you need to understand the stuff you’re doing and you need a theory of how it works before you make new assumptions and test those assumptions). He gave an example of the Apple visionPro being a machine that despite being new (the hardware capabilities, at least), the algorithm for tracking eyes they use was developed decades ago and was already well understood and proven correct by other applications.

      So his point in the episode is that real innovation just can’t be rushed by throwing money or more people at a problem. Because real innovation takes real scientists having novel insights and experiments to expand the knowledge we have. Sometimes those insights are completely random, often you need to have a whole career in that field and sometimes it takes a new genius to revolutionize it (think Newton and Einstein).

      Even the current wave of LLMs are simply a product of the Google’s paper that showed we could parallelize language models, leading to the creation of “larger language models”. That was Google doing science. But you can’t control when some new breakthrough is discovered, and LLMs are subject to this constraint.

      In fact, the only practice we know that actually accelerates science is the collaboration of scientists around the world, the publishing of reproducible papers so that others can expand upon and have insights you didn’t even think about, and so on.

      • morrowind@lemmy.ml
        link
        fedilink
        English
        arrow-up
        1
        ·
        6 days ago

        There’s been several smaller breakthroughs since then that arguably would not have happened without so many scientists suddenly turning their attention to the field.

  • vane@lemmy.world
    link
    fedilink
    English
    arrow-up
    13
    ·
    edit-2
    6 days ago

    The problem is that those companies are monopolies and can raise prices indefinitely to pursue this shitty dream because they got governments in their pockets. Because gov are cloud / microsoft software dependent - literally every country is on this planet - maybe except China / North Korea and Russia. They can like raise prices 10 times in next 10 years and don’t give a fuck. Spend 1 trillion on AI and say we’re near over and over again and literally nobody can stop them right now.

      • vane@lemmy.world
        link
        fedilink
        English
        arrow-up
        2
        ·
        edit-2
        5 days ago

        How many governments were using computers back then when IBM was controlling hardware and how many relied on paper and calculators ? The problem is that gov are dependend on companies right now, not companies dependent on governments.

        Imagine Apple, Google, Amazon and Microsoft decides to leave EU on Monday. They say we ban all European citizens from all of our services on Monday and we close all of our offices and delete data from all of our datacenters. Good Fucking Luck !

        What will happen in Europe on Monday ? Compare it with what would happen if IBM said 50 years ago they are leaving Europe.

    • Rin@lemm.ee
      link
      fedilink
      English
      arrow-up
      1
      ·
      5 days ago

      Yes, and maybe finding information right in front of them, and nothing more

  • Ledericas@lemm.ee
    link
    fedilink
    English
    arrow-up
    11
    ·
    6 days ago

    It’s because customers don’t want it or care for it, it’s only the corporations themselves are obsessed with it

  • ABetterTomorrow@lemm.ee
    link
    fedilink
    English
    arrow-up
    8
    ·
    6 days ago

    Current big tech is going to keeping pushing limits and have SM influencers/youtubers market and their consumers picking up the R&D bill. Emotionally I want to say stop innovating but really cut your speed by 75%. We are going to witness an era of optimization and efficiency. Most users just need a Pi 5 16gb, Intel NUC or an Apple air base models. Those are easy 7-10 year computers. No need to rush and get latest and greatest. I’m talking about everything computing in general. One point gaming,more people are waking up realizing they don’t need every new GPU, studios are burnt out, IPs are dying due to no lingering core base to keep franchise up float and consumers can’t keep opening their wallets. Hence studios like square enix going to start support all platforms and not do late stage capitalism with going with their own launcher with a store. It’s over.

  • iAvicenna@lemmy.world
    link
    fedilink
    English
    arrow-up
    6
    ·
    6 days ago

    The funny thing is with so much money you could probably do lots of great stuff with the existing AI as it is. Instead they put all the money into compute power so that they can overfit their LLMs to look like a human.

  • pixxelkick@lemmy.world
    link
    fedilink
    English
    arrow-up
    17
    arrow-down
    10
    ·
    6 days ago

    Meanwhile a huge chunk of the software industry is now heavily using this “dead end” technology 👀

    I work in a pretty massive tech company (think, the type that frequently acquires other smaller ones and absorbs them)

    Everyone I know here is using it. A lot.

    However my company also has tonnes of dedicated sessions and paid time to instruct it’s employees on how to use it well, and to get good value out of it, abd the pitfalls it can have

    So yeah turns out if you teach your employees how to use a tool, they start using it.

    I’d say LLMs have made me about 3x as efficient or so at my job.

    • Snot Flickerman@lemmy.blahaj.zone
      link
      fedilink
      English
      arrow-up
      29
      arrow-down
      3
      ·
      edit-2
      6 days ago

      Your labor before they had LLMs helped pay for the LLMs. If you’re 3x more efficient and not also getting 3x more time off for the labor you put in previously for your bosses to afford the LLMs you got ripped off my dude.

      If you’re working the same amount and not getting more time to cool your heels, maybe, just maybe, your own labor was exploited and used against you. Hyping how much harder you can work just makes you sound like a bitch.

      Real “tread on me harder, daddy!” vibes all throughout this thread. Meanwhile your CEO is buying another yacht.

      • pixxelkick@lemmy.world
        link
        fedilink
        English
        arrow-up
        16
        arrow-down
        2
        ·
        edit-2
        6 days ago

        I am indeed getting more time off for PD

        We delivered on a project 2 weeks ahead of schedule so we were given raises, I got a promotion, and we were given 2 weeks to just do some chill PD at our own discretion as a reward. All paid on the clock.

        Some companies are indeed pretty cool about it.

        I was asked to give some demos and do some chats with folks to spread info on how we had such success, and they were pretty fond of my methodology.

        At its core delivering faster does translate to getting bigger bonuses and kickbacks at my company, so yeah there’s actual financial incentive for me to perform way better.

        You also are ignoring the stress thing. If I can work 3x better, I can also just deliver in almost the same time, but spend all that freed up time instead focusing on quality, polishing the product up, documentation, double checking my work, testing, etc.

        Instead of scraping past the deadline by the skin of our teeth, we hit the deadline with a week or 2 to spare and spent a buncha extra time going over everything with a fine tooth comb twice to make sure we didn’t miss anything.

        And instead of mad rushing 8 hours straight, it’s just generally more casual. I can take it slower and do the same work but just in a less stressed out way. So I’m literally just physically working less hard, I feel happier, and overall my mood is way better, and I have way more energy.

        • Lemminary@lemmy.world
          link
          fedilink
          English
          arrow-up
          4
          ·
          6 days ago

          That sounds so cool! I’m glad you’re getting the benefits.

          I’m only wary that the cash-making machine will start tightening the ropes on the free time and the deadlines.

        • gamer@lemm.ee
          link
          fedilink
          English
          arrow-up
          1
          ·
          6 days ago

          Are you a software engineer? Without doxxing yourself, do you think you could share some more info or guidance? I’ve personally been trying to integrate AI code gen into my own work, but haven’t had much success.

          I’ve been able to ask ChatGPT to generate some simple but tedious code that would normally require me read through a bunch of documentation. Usually, that’s a third party library or a part of the standard library I’m not familiar with. My work is mostly Python and C++, and I’ve found that ChatGPT is terrible at C++ and more often than not generates code that doesn’t even compile. It is very good at generating Python by comparison, but unfortunately for me, that’s only like 10% of my work.

          For C++, I’ve found it helpful to ask misc questions about the design of the STL or new language features while I’m studying them myself. It’s not actually generating any code, but it definitely saves me some time. It’s very useful for translating C++'s “standardese” into english, for example. It still struggles generating valid code using C++20 or newer though.

          I also tried a few local models on my GPU, but haven’t had good results. I assume it’s a problem with the models I used not being optimized for code, or maybe the inference tools I tried weren’t using them right (oobabooga, kobold, and some others I don’t remember). If you have any recommendations for good coding models I can run locally on a 4090, I’d love to hear them!

          I tried using a few of those AI code editors (mostly VS Code plugins) years ago, and they really sucked. I’m sure things have improved since then, so maybe that’s the way to go?

          • pixxelkick@lemmy.world
            link
            fedilink
            English
            arrow-up
            2
            ·
            5 days ago

            I primarily use GPT style tools like ChatGPT and whatnot.

            The key is, rather than asking it to generate code, specify that you dont want code and instead want it to help you work through the solution. Tell it to ask you meaningful questions about your problem and effectively act as a rubber duck

            Then, after you’ve chosen a solution with it, ask it to generate code based on all the above convo.

            This will typically produce way higher quality results and helps avoid potential X/Y problems.

      • LuigiDidNothingWrong87@lemmy.world
        link
        fedilink
        English
        arrow-up
        4
        ·
        6 days ago

        This is how all tech innovation has gone. If you don’t let the bosses exploit your labour someone else will.

        If tech had unions this wouldn’t happen as much, but that’s why they don’t really exist.

    • andallthat@lemmy.world
      link
      fedilink
      English
      arrow-up
      14
      ·
      edit-2
      6 days ago

      It’s not that LLMs aren’t useful as they are. The problem is that they won’t stay as they are today, because they are too expensive. There are two ways for this to go (or an eventual combination of both:

      • Investors believe LLMs are going to get better and they keep pouring money into “AI” companies, allowing them to operate at a loss for longer That’s tied to the promise of an actual “intelligence” emerging out of a statistical model.

      • Investments stop pouring in, the bubble bursts and companies need to make money out of LLMs in their current state. To do that, they need to massively cut costs and monetize. I believe that’s called enshttificarion.

      • pixxelkick@lemmy.world
        link
        fedilink
        English
        arrow-up
        6
        arrow-down
        5
        ·
        6 days ago

        You skipped possibility 3, which is actively happening ing:

        Advancements in tech enable us to produce results at a much much cheaper cost

        Which us happening with diffusion style LLMs that simultaneously cost less to train, cost less to run, but also produce both faster abd better quality outputs.

        That’s a big part people forget about AI: it’s a feedback loop of improvement as soon as you can start using AI to develop AI

        And we are past that mark now, most developers have easy access to AI as a tool to improve their performance, and AI is made by… software developers

        So you get this loop where as we make better and better AIs, we get better and better at making AIs with the AIs…

        It’s incredibly likely the new diffusion AI systems were built with AI assisting in the process, enabling them to make a whole new tech innovation much faster and easier.

        We are now in the uptick of the singularity, and have been for about a year now.

        Same goes for hardware, it’s very likely now that mvidia has AI incorporating into their production process, using it for micro optimizations in its architectures and designs.

        And then those same optimized gpus turn around and get used to train and run even better AIs…

        In 5-10 years we will look back on 2024 as the start of a very wild ride.

        Remember we are just now in the “computers that take up entire warehouses” step of the tech.

        Remember that in the 80s, a “computer” cost a fortune, took tonnes of resources, multiple people to run it, took up an entire room, was slow as hell, and could only do basic stuff.

        But now 40 years later they fit in our pockets and are (non hyoerbole) billions of times faster.

        I think by 2035 we will be looking at AI as something mass produced for consumers to just go in their homes, you go to best buy and compare different AI boxes to pick which one you are gonna get for your home.

        We are still at the stage of people in the 80s looking at computers and pondering “why would someone even need to use this, why would someone put one in their house, let alone their pocket”

        • andallthat@lemmy.world
          link
          fedilink
          English
          arrow-up
          3
          ·
          6 days ago

          I want to believe that commoditization of AI will happen as you describe, with AI made by devs for devs. So far what I see is “developer productivity is now up and 1 dev can do the work of 3? Good, fire 2 devs out of 3. Or you know what? Make it 5 out of 6, because the remaining ones should get used to working 60 hours/week.”

          All that increased dev capacity needs to translate into new useful products. Right now the “new useful product” that all energies are poured into is… AI itself. Or even worse, shoehorning “AI-powered” features in all existing product, whether it makes sense or not (welcome, AI features in MS Notepad!). Once this masturbatory stage is over and the dust settles, I’m pretty confident that something new and useful will remain but for now the level of hype is tremendous!

          • pixxelkick@lemmy.world
            link
            fedilink
            English
            arrow-up
            1
            ·
            6 days ago

            Good, fire 2 devs out of 3.

            Companies that do this will fail.

            Successful companies respond to this by hiring more developers.

            Consider the taxi cab driver:

            With the invention if the automobile, cab drivers could do their job way faster and way cheaper.

            Did companies fire drivers in response? God no. They hired more

            Why?

            Because they became more affordable, less wealthy clients could now afford their services which means demand went way way up

            If you can do your work for half the cost, usually demand goes up by way more than x2 because as you go down in wealth levels of target demographics, your pool of clients exponentially grows

            If I go from “it costs me 100k to make you a website” to “it costs me 50k to make you a website” my pool of possible clients more than doubles

            Which means… you need to hire more devs asap to start matching this newfound level of demand

            If you fire devs when your demand is about to skyrocket, you fucked up bad lol

      • pixxelkick@lemmy.world
        link
        fedilink
        English
        arrow-up
        3
        ·
        6 days ago

        For sure, much like how a cab driver has to know how to drive a cab.

        AI is absolutely a “garbage in, garbage out” tool. Just having it doesn’t automatically make you good at your job.

        The difference in someone who can weild it well vs someone who has no idea what they are doing is palpable.

  • nectar45@lemmy.zip
    link
    fedilink
    English
    arrow-up
    9
    arrow-down
    2
    ·
    6 days ago

    Imo our current version of ai are too generalized, we add so much information into the ai to make them good at everything it all mixes together into a single grey halucinating slop that the ai ends up being good at nothing.

    We need to find ways to specialize ai and give said ai a more consistent and concrete personality to move forward.

    • nectar45@lemmy.zip
      link
      fedilink
      English
      arrow-up
      10
      ·
      6 days ago

      Imo to make an ai that is truly good at everything we need to have multiple ai all designed to do something different all working together (like the human brain works) instead of making every single ai a personality-less sludge of jack of all trades master of none

        • pixxelkick@lemmy.world
          link
          fedilink
          English
          arrow-up
          1
          ·
          6 days ago

          No, it’s just not something exposed to you to see

          But under the hood it very much does shift gears depending on what you ask it to do

          It’s why gpt can do stuff now like analyze contents of images, basic OCR, but also generate images too.

          Yet it can also do math, talk about biology, give relationship advice…

          I believe open AI called the term “specialists” or something vaguely like that, at the time.