27 Mar 23copy & paste+upvote-downvoteπ§΅1/8 RLHF (Reinforcement Learning from Human Feedback) - is an approach to fine-tune large-scale language models lβ¦ https://t.co/WGOYKlOvpu
β@_lexiemichelee: @MariaCabayubi you know how slow I am Maria. But it looks really goodβhah I know this. But thank youuu
14 Jan 13copy & paste+upvote-downvoteβ@_lexiemichelee: @MariaCabayubi you know how slow I am Maria. But it looks really goodππππβhah I know this. But thank youuu βΊπππ