Weaver introduces a new family of specialised large language models tailored for creative and professional writing. Offering models ranging from 1.8B to 34B parameters, said to outperform larger generalist models like GPT-4 by focusing on human-like text production and diverse content creation capabilities.

    • FaceDeer@kbin.social
      link
      fedilink
      arrow-up
      1
      ·
      9 months ago

      One of the size classes they mention in the abstract is called “Weaver Pro” so my initial assumption would be that it’s not. However, I find that with this sort of thing the most important secret is that something is possible. If Weaver works as advertised we will now know that it’s possible fir a 34B model to get better-than-GPT4 performance, which means lots of people will be willing to devote resources to recreating it since they now know those resources won’t be wasted.

      And if Weaver is meant to be “commercial” I wouldn’t be surprised if there’s a bunch of censorship baked into it, so the eventual open-source version will have an advantage.