Nobody releases training data. It’s too large and varied.
That’s why its not Open Source. They do not release the source and its impossible to build the model from source.
I’m here to stay.
Nobody releases training data. It’s too large and varied.
That’s why its not Open Source. They do not release the source and its impossible to build the model from source.
Can you actually explain what in my reply is “Fear, uncertainty, and doubt”? Did you actually read it? I even linked to the specific github repository, which is basically empty. You just link to an overview, which does not point to any source code.
Please explain whats FUD and link to the source code, otherwise do not call people FUD if you don’t know what you are talking about.
Define “open sourced model”.
The term itself is actually shockingly simple. Source is the original material that was used to build this model, training data and all files that are needed to compile and create the model. It’s Open Source, if these files are available (preferably with an Open Source compatible license). It’s not. We only get binary data, the end result and some intermediate files to fine tune it.
None of the code and training data is available. Its just the usual Huggingface thing, where some weights and parameters are available, nothing else. People repeat DeepSeek (and many other) Ai LLM models being open source, but they aren’t.
They even have a Github source code repository at https://github.com/deepseek-ai/DeepSeek-R1 , but its only an image and PDF file and links to download the model on Huggingface (plus optional weights and parameter files, to fine tune it). There is no source code, and no training data available. Also here is an interesting article talking about this issue: Liesenfeld, Andreas, and Mark Dingemanse. “Rethinking open source generative AI: open washing and the EU AI Act.” The 2024 ACM Conference on Fairness, Accountability, and Transparency. 2024
Well its still not Open Source.
What’s the point of using Rust, if you don’t want to think and program in Rust? If you seriously don’t want to learn and deal with safe code and think every step of it in advance before compilation, then Rust is the wrong language for you. Either use a low level language like C and Zig, which gives you control over the system, but does not have a borrow checker. Or use a language with a runtime check that does this automatically for you without a borrow checker, like Go in example.
For anyone who wonder what this is, GNU Screen is a terminal multiplexer similar to tmux.
Great! I wanted to use Matrix multiple times, but most communities are not there. Good to see Ubuntu switching, its a huge community.
We are all idiots sometimes. :-)
Thunderbird didn’t have any builtin.
We got Thunderbird notifications before GTA 6.
Finding good project names is one of the hardest things. It depends on the project, sometimes its just in front of you with play on words that has some meaning. It could be some historical reason why you name the project, or something out of blue with metaphorical meaning.
I usually think of a few words and then lookup in thesaurus for words with similar meaning, in example for the random word “management”: https://www.wordhippo.com/what-is/another-word-for/management.html Its often fruitless, sometimes not. So its worth trying if you get stuck.
Basically an distribution that is not a rolling release. Its hard to recommend a specific distribution. You could use one of the Ubuntus, a Fedora Atomic variant, Mint, they should be able to run for weeks without issues. Unless you update a system component that requires a restart to take into effect. Why not openSUSE Leap?
I’m personally on EndeavourOS, a rolling release and update often (even the Kernel). My PC is also on for 24h, usually for days, sometime even a week. One trick to avoid some of the restarts is to just logout and login the user. This should be no problem for you and at least some of the components start fresh due to login.
Then instead to complain the person can link it to other platforms too. I don’t get whats the problem is here.
What’s Linux specific here? Its about FFmpeg options, showing examples of how to use and explains them. If you understand the examples, then you can use it on any platform.
Good, but its too late. I don’t care for those games anymore. At least I might check future games out. This should have been done day one.
100% - first gen Deck
I am not a huge Deck user, meaning most of the PC time and games are played on my actual PC. The Deck is only a secondary device, for a few times here and there. I have over 130 hours on RetroArch, and a few hours on various other stuff. Therefore its expected that the battery is still in good condition.
Thanks for posting. I find the echo part and extra use of variable is a little bit flaky. Here is a modified version. But I am not 100% sure if its doing what your script is doing.
I skipped the extra variable and echo and grep, by comparing its content with ${*}
, which is similar to ${@}
, but won’t separate each argument and create a single string instead. The =~ /$
is a regex comparison, which Bash supports native. Then I am using ${@}
for the call, which separates each argument. Maybe this could be done with ${*}
instead. I’m not sure which of them is the correct one for this case. At least it seems filenames with spaces work. Otherwise, not claiming it would be better. Just giving some food for thoughts.
#!/usr/bin/bash
if [[ "${*}" =~ /$ ]]; then
xargs -rd '\n' -I {} "${@}"{}
else
xargs -rd '\n' -I {} "${@}" {}
fi
Well, its all there in the link and screenshot shows. The option is “Allow Github to use my data for Ai model training”. At least I think this includes Copilot training, as far as I understand. The Privacy Statement link below specifically talks about Copilot:
Allow GitHub, its affiliates and third parties to use my data,
including Prompts, Suggestions, and Code Snippets, for AI model training.
What are you talking about? What bad faith are you saying to me? I ask you to show me the repository that contains the source code. There is none. Please give me a link to the repo you have in mind. Where is the source code and training data of DeepSeek-R1? Can we build the model from source?