• KeenFlame@feddit.nu
    link
    fedilink
    English
    arrow-up
    1
    arrow-down
    1
    ·
    edit-2
    8 months ago

    Absolutely, it’s one of the first curious things you discover when using them, such as stable diffusion “masterpiece” or the famous system prompt leaks from proprietary llms

    It makes sense in how it works but in proprietary use it is mostly handled for you

    Finding the right words and amount is a hilarious exercise that provides pretty good insight in the attention mechanics

    Consider the “let’s work step by step”

    This proved a revolutionary way to system the coders as they then will structure the output better, there’s then more research that happened around why this is so amazingly effective at making the model proof check itself

    Predictions are obviously closely related to the action part of our brains as well, so it makes sense that it would help when you think about it