• ☆ Yσɠƚԋσʂ ☆@lemmy.mlOP
    link
    fedilink
    arrow-up
    2
    arrow-down
    9
    ·
    edit-2
    4 days ago

    So you found a legacy data set that’s been released nearly a year ago as your best example. Thanks for proving my point. And since you obviously know what you’re talking about, do explain to the class what stops people from using these data sets to train a DeepSeek model?

      • ☆ Yσɠƚԋσʂ ☆@lemmy.mlOP
        link
        fedilink
        arrow-up
        2
        arrow-down
        7
        ·
        4 days ago

        What I do know is that you can take DeepSeek model and train it on this open crawl to get a fully open model. I love how you ignored this part in your reply being the clown that you are.

        • The Octonaut@mander.xyz
          link
          fedilink
          arrow-up
          10
          arrow-down
          2
          ·
          4 days ago

          I ignored the bit you edited in after I replied? And you’re complaining about ignoring questions in general? Do you disagree with the OSI definition Yogsy? You feel ready for that question yet?

          What on earth do you even mean “take a model and train it on thos open crawl to get a fully open model”? This sentence doesn’t even make sense. Never mind that that’s not how training a model works - let’s pretend it is. You understand that adding open source data to closed source data wouldn’t make the closed source data less closed source, right?.. Right?

          Thank fuck you’re not paid real money for this Yiggly because they’d be looking for their dollars back

          • ☆ Yσɠƚԋσʂ ☆@lemmy.mlOP
            link
            fedilink
            arrow-up
            2
            arrow-down
            4
            ·
            4 days ago

            Why would you lie about something with timestamps. I edited 18 min ago, and you replied 17 min ago. 🤡

            Do you disagree with the OSI definition Yogsy? You feel ready for that question yet?

            I already answered this question earlier in the thread, but clearly your reading comprehension needs some work.

            What on earth do you even mean “take a model and train it on thos open crawl to get a fully open model”?

            I’m talking about taking the code that DeepSeek released publicly, and training it on the open source data that’s available. That’s what model training is. The fact that this needs to be spelled out for you is amazing.

            You understand that adding open source data to closed source data wouldn’t make the closed source data less closed source, right?.. Right?

            What closed source data are you talking about, nobody is suggesting this.

            Thank fuck you’re not paid real money for this Yiggly because they’d be looking for their dollars back

            You sound upset there little buddy. I guess misspelling my handle was the peak insult you could muster. Really showing your intellectual prowess there champ.

            • The Octonaut@mander.xyz
              link
              fedilink
              arrow-up
              3
              arrow-down
              2
              ·
              4 days ago

              I take more than a minute on my replies Autocorrect Disaster. You asked for information and I treat your request as genuine because it just leads to more hilarity like you describing a model as “code”.

              • ☆ Yσɠƚԋσʂ ☆@lemmy.mlOP
                link
                fedilink
                arrow-up
                2
                ·
                3 days ago

                The only hilarity here is you exposing yourself as being utterly clueless on the subject you’re attempting to debate. A model is a deep neural network that’s generated by code through reinforcement training on the data. Evidently you don’t understand this leading you to make absurd statements. I asked you for information because I knew you were a troll and now you’ve confirmed it.

                • The Octonaut@mander.xyz
                  link
                  fedilink
                  arrow-up
                  2
                  arrow-down
                  1
                  ·
                  3 days ago

                  I understand it completely in so much that it’s nonsensically irrelevant - the model is what you’re calling open source, and the model is not open source because the data set not published or recreateable. They can open source any training code they want - I genuinely haven’t even checked - but the model is not open source. Which is my point from about 20 comments ago. Unless you disagree with the OSI’s definition which is a valid and interesting opinion. If that’s the case you could have just said so. OSI are just of dudes. They have plenty of critics in the Free/Open communities. Hey they’re probably American too if you want to throw in some downfall of The West classic hits too!

                  If a troll is “not letting you pretend you have a clue what you’re talking about because you managed to get ollama to run a model locally and think it’s neat”, cool. Owning that. You could also just try owning that you think its neat. It is. It’s not an open source model though. You can run Meta’s model with the same level of privacy (offline) and with the same level of ability to adapt or recreate it (you can’t, you don’t have the full data set or steps to recreate it).

                  • ☆ Yσɠƚԋσʂ ☆@lemmy.mlOP
                    link
                    fedilink
                    arrow-up
                    2
                    arrow-down
                    1
                    ·
                    3 days ago

                    I never disagreed that you can run Meta’s model with the same level of privacy, so don’t know why you keep bringing that up as some sort of gotcha. The point about DeepSeek is its efficiency. OSI definition for open source is good, and it does look like you’re right that the full data set is not available. However, the real question is why you’d be so hung up on that.

                    Given that the code for training a new model is released, and it can be applied to open data sets, that means it’s perfectly possible to make a version that’s trained on open data that would check off the final requirement you keep bringing up. Also, adapting it does not require having the original training set since it’s done by tuning the weights in the network itself. Go read up on how LoRA works for example.