I’m a tech interested guy. I’ve touched SQL once or twice, but wasn’t able to really make sense of it. That combined with not having a practical use leaves SQL as largely a black box in my mind (though I am somewhat familiar with technical concepts in databasing).

With that, I keep seeing [pic related] as proof that Elon Musk doesn’t understand SQL.

Can someone give me a technical explanation for how one would come to that conclusion? I’d love if you could pass technical documentation for that.

  • 9point6@lemmy.world
    link
    fedilink
    arrow-up
    200
    ·
    7 days ago

    The statement “this [guy] thinks the government uses SQL” demonstrates a complete and total lack of knowledge as to what SQL even is. Every government on the planet makes extensive and well documented use of it.

    The initial statement I believe is down to a combination of the above and also the lack of domain knowledge around social security. The primary key on the social security table would be a composite key of both the SSN and a date of birth—duplicates are expected of just parts of the key.

    If he knew the domain, he would know this isn’t an issue. If he knew the technology he would be able to see the constraint and following investigation, reach the conclusion that it’s not an issue.

    The man continues to be a malignant moron

    • snooggums@lemmy.world
      link
      fedilink
      English
      arrow-up
      25
      arrow-down
      2
      ·
      edit-2
      7 days ago

      The initial statement I believe is down to a combination of the above and also the lack of domain knowledge around social security. The primary key on the social security table would be a composite key of both the SSN and a date of birth—duplicates are expected of just parts of the key.

      Since SSNs are never reused, what would be the purpose of using the SSN and birth date together as part of the primary key? I guess it is the one thing that isn’t supposed to ever change (barring a clerical error) so I could see that as a good second piece of information, just not sure what it would be adding.

      Note: if duplicate SSNs are accidentally issued my understanding is that they issue a new one to one of the people and I don’t know how to find the start of the thread on twitter since I only use it when I accidentally click on a link to it.

      https://www.ssa.gov/history/hfaq.html

      Q20: Are Social Security numbers reused after a person dies?

      A: No. We do not reassign a Social Security number (SSN) after the number holder’s death. Even though we have issued over 453 million SSNs so far, and we assign about 5 and one-half million new numbers a year, the current numbering system will provide us with enough new numbers for several generations into the future with no changes in the numbering system.

      • halcyonloon@midwest.social
        link
        fedilink
        English
        arrow-up
        21
        ·
        7 days ago

        Take this with a grain of salt as I’m not a dev, but do work on CMS reporting for a health information tech company. Depending on how the database is designed an SSN could appear in multiple tables.

        In my experience reduplication happens as part of generating a report so that all relevant data related to a key and scope of the report can be gathered from the various tables.

        • DahGangalang@infosec.pubOP
          link
          fedilink
          arrow-up
          18
          ·
          edit-2
          7 days ago

          A given SSN appearing in multiple tables actually makes sense. To someone not familiar with SQL (i.e. at about my level of understanding), I could see that being misinterpreted as having multiple SSN repeated “in the database”.

          Of all the comments ao far, I find yours the most compelling.

          • Barbarian@sh.itjust.works
            link
            fedilink
            arrow-up
            11
            ·
            edit-2
            7 days ago

            Theoretically, yeah, that’s one solution. The more reasonable thing to do would be to use the foreign key though. So, for example:

            SSN_Table

            ID | SSN | Other info

            Other_Table

            ID | SSN_ID | Other info

            When you want to connect them to have both sets of info, it’d be the following:

            SELECT * FROM SSN_Table JOIN Other_Table ON SSN_Table.ID = Other_Table.SSN_ID

            EDIT: Oh, just to clear up any confusion, the SSN_ID in this simple example is not the SSN itself. To access that in this example query, it’d by SSN_Table.SSN

            • DahGangalang@infosec.pubOP
              link
              fedilink
              arrow-up
              2
              ·
              7 days ago

              Yeah, databases are complicated and make my head hurt. Glancing through resources from other comments, I’m realizing I know next to nothing about database optimization. Like, my gut reaction to your comment is that it seems like unnecessary overhead to have that data across two tables - but if one sub-dept didn’t need access to the raw SSN, but did need access to less personal data, j could see those stored in separate tables.

              But anyway, you’re helping clear things up for me. I really appreciate the pseudo code level example.

              • Barbarian@sh.itjust.works
                link
                fedilink
                arrow-up
                5
                ·
                edit-2
                7 days ago

                It’s necessary to split it out into different tables if you have a one-to-many relationship. Let’s say you have a list of driver licenses the person has had over the years, for example. Then you’d need the second table. So something like this:

                SSN_Table

                ID | SSN | Other info

                Driver_License_Table

                ID | SSN_ID | Issue_Date | Expiry_Date | Other_Info

                Then you could do something like pull up a person’s latest driver’s license, or list all the ones they had, or pull up the SSN associated with that license.

                • Arcka@midwest.social
                  link
                  fedilink
                  English
                  arrow-up
                  2
                  ·
                  6 days ago

                  I think a likely scenario would be for name changes, such as taking your partner’s surname after marriage.

        • snooggums@lemmy.world
          link
          fedilink
          English
          arrow-up
          6
          ·
          7 days ago

          It is common for long lived databases with a rotating cast of devs to use different formats in different tables as well! One might have it as a string, one might have it as a number, and the other might have it with hyphens in the same database.

          Hell, I work in a state agency and one of our older databases has a dozen tables with databases.

          • One has the whole thing as a long int: 222333444
          • One has the whole thing as a string: 2223334444 (which of course can’t be directly compared to the one that is a long int…)
          • One has separate fields for area code and the rest with a hyphen: 222 and 333-4444
          • One has the whole thing with parenthesis, a space, and a hyphen as a string: (222) 333-4444

          The main reason for the discrepancy is not looking at what was used before or not understanding that they can always change the formatting when displayed so they don’t need to include the parenthesis or hyphens in the database itself.

        • Ephera@lemmy.ml
          link
          fedilink
          English
          arrow-up
          6
          ·
          7 days ago

          The SSN is likely to appear in multiple tables, because they will reference a central table that ties it all together. This central table will likely only contain the SSN, the birth date (from what others have been saying), as well as potentially first and last name. In this table, the entries have to be unique.
          But then you might have another table, like a table listing all the physical exams, which has the SSN to be able to link it to the person’s name, but ultimately just adds more information to this one person. It does not duplicate the SSN in a way that would be bad.

      • DahGangalang@infosec.pubOP
        link
        fedilink
        arrow-up
        3
        ·
        7 days ago

        Beat me to asking this follow up, though you linking additional resources is probably more effort that I would have done. Thanks for that!