howrar@lemmy.caMEnglish · 4 days agoFactorio Learning Environmentplus-squarejackhopkins.github.ioexternal-linkmessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkFactorio Learning Environmentplus-squarejackhopkins.github.iohowrar@lemmy.caMEnglish · 4 days agomessage-square0fedilink
howrar@lemmy.caMEnglish · 11 days agoAndrew Barto and Richard Sutton are the recipients of the 2024 ACM A.M. Turing Award for developing the conceptual and algorithmic foundations of reinforcement learning.plus-squarewww.acm.orgexternal-linkmessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkAndrew Barto and Richard Sutton are the recipients of the 2024 ACM A.M. Turing Award for developing the conceptual and algorithmic foundations of reinforcement learning.plus-squarewww.acm.orghowrar@lemmy.caMEnglish · 11 days agomessage-square0fedilink
howrar@lemmy.caMEnglish · 1 month agoOpen Sourcing π₀plus-squarewww.physicalintelligence.companyexternal-linkmessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkOpen Sourcing π₀plus-squarewww.physicalintelligence.companyhowrar@lemmy.caMEnglish · 1 month agomessage-square0fedilink
howrar@lemmy.caMEnglish · 1 month agoA Little Bit of Reinforcement Learning from Human Feedback -- Nathan Lambertplus-squarerlhfbook.comexternal-linkmessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkA Little Bit of Reinforcement Learning from Human Feedback -- Nathan Lambertplus-squarerlhfbook.comhowrar@lemmy.caMEnglish · 1 month agomessage-square0fedilink
howrar@lemmy.caMEnglish · 3 months agoReinforcement Learning: An Overviewplus-squarearxiv.orgexternal-linkmessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkReinforcement Learning: An Overviewplus-squarearxiv.orghowrar@lemmy.caMEnglish · 3 months agomessage-square0fedilink
howrar@lemmy.caMEnglish · 5 months agoKeynotes from the 2024 Reinforcement Learning Conferenceplus-squarewww.youtube.comexternal-linkmessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkKeynotes from the 2024 Reinforcement Learning Conferenceplus-squarewww.youtube.comhowrar@lemmy.caMEnglish · 5 months agomessage-square0fedilink
howrar@lemmy.caMEnglish · edit-26 months agoOpenAI: Learning to Reason with LLMsplus-squareopenai.comexternal-linkmessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkOpenAI: Learning to Reason with LLMsplus-squareopenai.comhowrar@lemmy.caMEnglish · edit-26 months agomessage-square0fedilink
howrar@lemmy.caMEnglish · 1 year agoIntroducing SIMA, a Scalable Instructable Multiworld Agentplus-squaredeepmind.googleexternal-linkmessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkIntroducing SIMA, a Scalable Instructable Multiworld Agentplus-squaredeepmind.googlehowrar@lemmy.caMEnglish · 1 year agomessage-square0fedilink