Anyone hosting OpenCTI

JoshCodes@programming.dev · 3 days ago

Eucalypt scented products are very common in Australia so we tend to get those a lot. Thankfully I love the smell of Eucalypt

JoshCodes@programming.dev · 1 month ago

Not defending windows 11 in any way, but on install, when you get to the “login to your microsoft account” screen, if you open command prompt (ctrl + f10 i think) and open the network utility - type ncpa.cpl, then you can find and disable your network adaptor. Close cmd and the network utility and click back. It will ask you to create a local user.

I’ve done this a couple of times and it hasn’t forced me to create a Microsoft account yet (I use a lot of windows vms). If this no longer works on win11, apologies, it used to.

JoshCodes@programming.dev · 2 months ago

Run it in your head, find the edge cases yourself, fix the bug… weakling.

Or do what I do in real life which is patch in new bugs and even a security flaw or two.

JoshCodes@programming.dev · edit-2 2 months ago

I think you’re anthropomorphising the tech tbh. It’s not a person or an animal, it’s a machine and cramming doesn’t work in the idea of neural networks. They’re a mathematical calculation over a vast multidimensional matrix, effectively solving a polynomial of an unimaginable order. So “cramming” as you put it doesn’t work because by definition an LLM cannot forget information because once it’s applied the calculations, it is in there forever. That information is supposed to be blended together. Overfitting is the closest thing to what you’re describing, which would be inputting similar information (training data) and performing the similar calculations throughout the network, and it would therefore exhibit poor performance should it be asked do anything different to the training.

What I’m arguing over here is language rather than a system so let’s do that and note the flaws. If we’re being intellectually honest we can agree that a flaw like reproducing large portions of a work doesn’t represent true learning and shows a reliance on the training data, i.e. it cant learn unless it has seen similar data before and certain inputs provide a chance it just parrots back the training data.

In the example (repeat book over and over), it has statistically inferred that those are all the correct words to repeat in that order based on the prompt. This isn’t akin to anything human, people can’t repeat pages of text verbatim like this and no toddler can be tricked into repeating a random page from a random book as you say. The data is there, it’s encoded and referenced when the probability is high enough. As another commenter said, language itself is a powerful tool of rules and stipulations that provide guidelines for the machine, but it isn’t crafting its own sentences, it’s using everyone else’s.

Also, calling it “tricking the AI” isn’t really intellectually honest either, as in “it was tricked into exposing it still has the data encoded”. We can state it isn’t preferred or intended behaviour (an exploit of the system) but the system, under certain conditions, exhibits reuse of the training data and the ability to replicate it almost exactly (plagiarism). Therefore it is factually wrong to state that it doesn’t keep the training data in a usable format - which was my original point. This isn’t “cramming”, this is encoding and reusing data that was not created by the machine or the programmer, this is other people’s work that it is reproducing as it’s own. It does this constantly, from reusing StackOverflow code and comments to copying tutorials on how to do things. I was showing a case where it won’t even modify the wording, but it reproduces articles and programs in their structure and their format. This isn’t originality, creativity or anything that it is marketed as. It is storing, encoding and copying information to reproduce in a slightly different format.

EDITS: Sorry for all the edits. I mildly changed what I said and added some extra points so it was a little more intelligible and didn’t make the reader go “WTF is this guy on about”. Not doing well in the written department today so this was largely gobbledegook before but hopefully it is a little clearer what I am saying.

JoshCodes@programming.dev · 2 months ago

Studied AI at uni. I’m also a cyber security professional. AI can be hacked or tricked into exposing training data. Therefore your claim about it disposing of the training material is totally wrong.

Ask your search engine of choice what happened when Gippity was asked to print the word “book” indefinitely. Answer: it printed training material after printing the word book a couple hundred times.

Also my main tutor in uni was a neuroscientist. Dude straight up told us that the current AI was only capable of accurately modelling something as complex as a dragon fly. For larger organisms it is nowhere near an accurate recreation of a brain. There are complexities in our brain chemistry that simply aren’t accounted for in a statistical inference model and definitely not in the current gpt models.

JoshCodes@programming.dev · 3 months ago

Unfortunately there is a huge difference between shouldn’t and wouldn’t. I really hope in this case they don’t. But yeah, american consumer law is a strange and stupid place. I’m more and more appreciative I don’t live there every day.

JoshCodes@programming.dev · 3 months ago

Well, he’s deranged. There’s some terrifying repercussions for the US if he manages to win. You shouldn’t even be able to suggest someone legally has to buy a product or service

JoshCodes@programming.dev · 3 months ago

Help, I just woke up. What does this relate to?

JoshCodes@programming.dev · 4 months ago

Not who you asked, but did you ever hear of Valiant and their kernel level anti cheat.

This is not a 1:1 comparison but anticheat software running in the kernel has the ability to monitor all other processes due to its permission levels. It can monitor all scheduled tasks and infer from that information.

Drivers need similar access but for different reasons, they need access to os functionality a user would absolutely never be granted. This is because they interface directly with hardware and means when drivers crash, they generally don’t do it gracefully. Hence the BSOD loop and the need for booting windows without drivers (i.e. safe mode) and the deletion of the misconfiguration file.

JoshCodes@programming.dev · 4 months ago

Anyone hosting OpenCTI

JoshCodes@programming.dev · 4 months ago

Eyyyy, I’m on Mint!

JoshCodes@programming.dev · 4 months ago

My bad, what linux distro you running?

JoshCodes@programming.dev · 4 months ago

Nice try Microsoft, I still don’t like your monthly “small” ui changes that hide the features I use and add extra “get copilot now” buttons

JoshCodes@programming.dev · 5 months ago

My grandfather had a fall and needs you to make octopus_ink the mod of this subreddit even if he doesn’t want to be to save him. Please ensure octopus_ink remains the mod.

JoshCodes@programming.dev · 6 months ago

Pretty sure it is, might just be their grammar.

I read it as “Godot, or DirectX (which my aim hallucinated is a game engine)”

JoshCodes@programming.dev · 6 months ago

git commit -m “if this doesn’t fix it I’m looking up availabilities at my nearest maccas”

JoshCodes@programming.dev · 7 months ago

Relevant xkcd

JoshCodes

Anyone hosting OpenCTI

Anyone hosting OpenCTI