99707145
Mar 17, 2026
Providing temporary use of online, non-downloadable computer software for managing the full reinforcement learning (RL) lifecycle, namely, building, training, scaling, monitoring, and deploying AI agents; Providing temporary use of online, non-downloadable computer software for building, training, scaling, monitoring, and deploying reinforcement learning (RL) workloads and applications; Providing temporary use of online, non-downloadable computer software featuring a reinforcement learning operations (RLOs) platform; Providing temporary use of online, non-downloadable computer software for automatic tuning of any single- and multi-agent task; Providing temporary use of online, non-downloadable computer software for fine-tuning large language models (LLMs) using hyperparameter optimization (HPO) and one-click deployment; Providing temporary use of online, non-downloadable computer software for validating datasets and environments before training AI agents; Providing temporary use of online, non-downloadable computer software for selecting algorithms, rewards, constraints, and objectives to configure AI agents; Providing temporary use of online, non-downloadable computer software for distributing AI agents across multiple graphic processing units (GPUs) and for monitoring metrics, sampling efficiency, and providing checkpoints in real time; Providing temporary use of online, non-downloadable computer software for promoting a model from a testing environment to a live production environment with one click; Providing temporary use of online, non-downloadable computer software for monitoring and tracking AI agent performance, executing instant roll backs, and iterating through training credits
Computer and Scientific