Newest Comments

  1.  

    There’s a surfeit of humans, web pages, books, etc capable of being confidently wrong, and sometimes right w/ zero citations. I think the cost of running this AI BS machine will eventually come down to the point where it will fit in a sort of Billy the Bass box and be able to be purchased at spencer’s gifts for us to ask questions of and get fun answers back. But hopefully between now and then a ton of copyright and trademark and other disputes will put an end to training on “everything” only to add more BS to the pile for the next model to train on.

      1.  

        Both ASN.1 and protobuf are tag-length-value encodings with an external schema

        … with the major difference that SEQUENCEs are not delimited with a length. both formats support extensions but ASN.1 requires you to explicitly make space for the extensions in the SEQUENCE while protobuf wants you to declare space in the tag ranges but will accept unknown fields anywhere.

        The complex semantics of X.509 are exacerbated by the schema-mandatory parsing of ASN.1.

        1.  

          A few reasons:

          1. As far as I understand, ChatGPT is different in that it’s trained on more conversation than regular GPT is? But don’t quote me on that.
          2. The chat format is significantly more accessible and easy to understand.
          3. Natural hype cycles, where the “v2” (or what the public perceives to be a v2) of an exciting thing receives a lot more attention than even the original thing.
          1.  

            This looks fantastic. I just added the backlinks Plug, nice! I wish everything (even Back) was a command. I also wish for a cheat sheet on hotkeys.

            1.  

              Perhaps prompt engineering was more important than it first seemed, at least for applying text transformers to practical problems. I don’t think that we have a full explanation of how this particular GPT is prompted yet; but if we do, it would be interesting to try it out on GPT-2 or other publically-available models.

              1.  

                I feel like most of the time when they do what you’re saying, it’s either a side effect of doing something else they consider useful, or a deliberate choice to do something risky because they feel it’s worth it overall.

                Whereas last time I wanted to intentionally expose pgadmin over my VPN instead of just running it locally, I had to jump through hoops to even get it to listen on a non-loopback interface.

                So while it wouldn’t surprise me that someone would do so, I’m not nearly as worried that it’ll be this mass, wormable problem that impacts nearly all installs of pgadmin, as I was when I read the headline initially…

                1.  

                  What would be the best way to store version history? I’d probably want it to git commit everything automatically. Is this something a plugin can do?

                  Or maybe that’s better handled separately. There might be some generic tool that watches a directory for changes and commits automatically.

                  Oh, but maybe sqlite in git is a bad idea? (Edit: you’d probably just gitignore it since it can always be rederived from the .md files.)

                  1.  

                    I think the model isn’t large enough to memorize anything precisely enough to be caught plagiarizing.

                    I’ve played with the trick of convincing it it’s a Linux terminal which can “run” curl https://… and for several URLs it printed HTML that was a plausible content of the site, but the details were entirely made up. For the URL of my personal homepage, it knew my name, it knew I’m a programmer from Poland, but hallucinated HTML markup with a bland structure of the site that is nothing like my actual page.

                    1.  

                      I do love chatGPT but what I guess I’ve not seen yet is people pointing out it “plagiarizes” in the same sense as those art generation bots that caught a similar hype wave a month or two ago.

                      Woah, I haven’t heard of this. Do you mean like the training set is plagiarized, or that it actually outputs plagiarized works?

                      1.  

                        People publish systems to the internet that controls power plants and other types of industrial facilities. Wouldn’t surprise me that someone would expose Pgadmin to the internet.

                        1.  

                          These kinds of bugs are terrifying. You’re one click away in a browser from getting your home directory curled to the malicious third party.

                          It’d be great to have browsers use syscalls like pledge + limit the filesystem access to basically zero so RCE is not as disastrous. Unless something similar is already being done.

                          1.  

                            The scale of this change is fascinating. At 29x faster encoding and 4.5x faster decoding for something done maybe once per HTTP request, the performance difference is imperceptible to the individual user. Still, across humanity, given the ubiquity of curl, the universe is spending appreciably fewer CPU cycles and wall clock time doing base64 operations. I wonder if it could be measured in years per day.

                            1.  

                              Logseq has different file structure, IIRC you can separate directories with dots in wiki links. I prefer Obsidian’s approach, where you specify a path relative to the root of your vault. It uses shortened version if note name is unique.

                              1.  

                                Could someone explain to me why ChatGPT has had hella more explosive popularity than the first release of GPT3?… It’s the same stuff - I just tried it and it’s really no different than GPT3 from day 1. I’m very confused.

                                1.  

                                  The root problem is trying to design a single instruction set for different market segments that have very different requirements

                                  Indeed. The worst part is that they managed to go all in on extension hell and fusion

                                  1.  

                                    The knowledge of the 6 year compromise extended for 5 months before the announcement.

                                    1.  

                                      No one is saying it’s impossible, just that it’s as hard or harder than making JS fast & that took Google level expenditure over many years.

                                      1.  

                                        I find it frustrating that people even while understandably avoiding giving a definition of consciousness are also unwilling to give a general sense of what they mean by consciousness, since it’s pretty clearly a multivalent term. Conscious could refer to being awake vs. being asleep, or being a human vs. being a monkey, or being a human adult vs. being a human baby, or being an unreflective human vs. being an introspective human, or having active awareness of a stimuli vs. having a stimuli without active awareness of it, or … In any event, it’s worth clarifying the ballpark of what you’re talking about before getting into a big debate about it.

                                        1.  

                                          not everyone has the luxury to be paid to work on a project like Daniel does. Time is a limited resource. Why spend it on something that is mostly fine?