Code used in the analysis is here

  • LWD@lemm.ee
    link
    fedilink
    English
    arrow-up
    5
    ·
    4 months ago

    Extremely intended! They built a model to lie and a surrogate model to say the first model was being truthful.

    They called it LaundryML.