• daniyeg [he/him]@hexbear.net
    link
    fedilink
    English
    arrow-up
    5
    ·
    22 days ago

    yeah absolutely especially considering agentic use cases it’s limited but i still think it’s a valid metric that is at least correlated with performance for purely chatbot users.