A Text-based Safety Benchmark for Reinforcement Learning ProblemsNgoc Lan HoangNicolas Galichetet al.2022NeurIPS 2022
STARLING: Self-supervised Training of Text-based Reinforcement Learning Agent with Large Language ModelsShreyas BasavatiaKeerthiram Murugesanet al.2024ACL 2024
ComplexWorld: A Large Language Model-based Interactive Fiction Learning Environment for Text-based Reinforcement Learning AgentsShreyas BasavatiaShivam Ratnakaret al.2023IJCAI 2023
Graphical modeling for dynamic safety hints generalisation for Safe Deep Reinforcement Learning AgentsLamogha ChiazorNgoc Lan Hoanget al.2023IJCAI 2023