René's URL Explorer Experiment


Title: trinity.algorithm.algorithm module — Trinity-RFT 0.4.0 documentation

direct link

Domain: modelscope.github.io

docsearch:languageen

Links:

Skip to main contenthttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#main-content
https://modelscope.github.io/Trinity-RFT/en/main/index.html
latesthttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html
v0.4.0https://modelscope.github.io/Trinity-RFT/en/v0.4.0/index.html
v0.3.3https://modelscope.github.io/Trinity-RFT/en/v0.3.3/index.html
Installationhttps://modelscope.github.io/Trinity-RFT/en/main/tutorial/trinity_installation.html
Developer Guidehttps://modelscope.github.io/Trinity-RFT/en/main/tutorial/develop_overview.html
Workflow Development Guidehttps://modelscope.github.io/Trinity-RFT/en/main/tutorial/develop_workflow.html
Algorithms Development Guidehttps://modelscope.github.io/Trinity-RFT/en/main/tutorial/develop_algorithm.html
Advanced Algorithm Developmenthttps://modelscope.github.io/Trinity-RFT/en/main/tutorial/example_mix_algo.html
Operator Development Guidehttps://modelscope.github.io/Trinity-RFT/en/main/tutorial/develop_operator.html
🧪 Experimental: Task Selectionhttps://modelscope.github.io/Trinity-RFT/en/main/tutorial/develop_selector.html
Configuration Guidehttps://modelscope.github.io/Trinity-RFT/en/main/tutorial/trinity_configs.html
GPU Configuration Guidehttps://modelscope.github.io/Trinity-RFT/en/main/tutorial/trinity_gpu_configs.html
Synchronizer in Trinity-RFThttps://modelscope.github.io/Trinity-RFT/en/main/tutorial/synchronizer.html
Align configuration with veRLhttps://modelscope.github.io/Trinity-RFT/en/main/tutorial/align_with_verl.html
Quick Starthttps://modelscope.github.io/Trinity-RFT/en/main/tutorial/example_reasoning_basic.html
Off-Policy RFThttps://modelscope.github.io/Trinity-RFT/en/main/tutorial/example_reasoning_advanced.html
Asynchronous RFThttps://modelscope.github.io/Trinity-RFT/en/main/tutorial/example_async_mode.html
Concatenated Multi-Turn RFThttps://modelscope.github.io/Trinity-RFT/en/main/tutorial/example_multi_turn.html
General Multi-Step RFThttps://modelscope.github.io/Trinity-RFT/en/main/tutorial/example_step_wise.html
ReAct Agent Traininghttps://modelscope.github.io/Trinity-RFT/en/main/tutorial/example_react.html
Email Search Workflowhttps://modelscope.github.io/Trinity-RFT/en/main/tutorial/example_search_email.html
Offline DPO and SFThttps://modelscope.github.io/Trinity-RFT/en/main/tutorial/example_dpo.html
Tinker Backendhttps://modelscope.github.io/Trinity-RFT/en/main/tutorial/example_tinker_backend.html
Megatron-LM Backendhttps://modelscope.github.io/Trinity-RFT/en/main/tutorial/example_megatron.html
Data Processinghttps://modelscope.github.io/Trinity-RFT/en/main/tutorial/example_data_functionalities.html
Example Summaryhttps://modelscope.github.io/Trinity-RFT/en/main/tutorial/example_dataset_perspective.html
FAQhttps://modelscope.github.io/Trinity-RFT/en/main/tutorial/faq.html
API Referencehttps://modelscope.github.io/Trinity-RFT/en/main/api_reference.html
trinity.buffer packagehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.buffer.html
trinity.buffer.operators packagehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.buffer.operators.html
trinity.buffer.operators.filters packagehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.buffer.operators.filters.html
trinity.buffer.operators.mappers packagehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.buffer.operators.mappers.html
trinity.buffer.operators.data_juicer_operator modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.buffer.operators.data_juicer_operator.html
trinity.buffer.operators.experience_operator modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.buffer.operators.experience_operator.html
trinity.buffer.pipelines packagehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.buffer.pipelines.html
trinity.buffer.pipelines.experience_pipeline modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.buffer.pipelines.experience_pipeline.html
trinity.buffer.pipelines.task_pipeline modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.buffer.pipelines.task_pipeline.html
trinity.buffer.reader packagehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.buffer.reader.html
trinity.buffer.reader.file_reader modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.buffer.reader.file_reader.html
trinity.buffer.reader.queue_reader modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.buffer.reader.queue_reader.html
trinity.buffer.reader.sql_reader modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.buffer.reader.sql_reader.html
trinity.buffer.schema packagehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.buffer.schema.html
trinity.buffer.schema.formatter modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.buffer.schema.formatter.html
trinity.buffer.schema.sql_schema modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.buffer.schema.sql_schema.html
trinity.buffer.selector packagehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.buffer.selector.html
trinity.buffer.selector.difficulty_estimator modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.buffer.selector.difficulty_estimator.html
trinity.buffer.selector.selector modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.buffer.selector.selector.html
trinity.buffer.storage packagehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.buffer.storage.html
trinity.buffer.storage.file modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.buffer.storage.file.html
trinity.buffer.storage.queue modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.buffer.storage.queue.html
trinity.buffer.storage.sql modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.buffer.storage.sql.html
trinity.buffer.writer packagehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.buffer.writer.html
trinity.buffer.writer.file_writer modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.buffer.writer.file_writer.html
trinity.buffer.writer.queue_writer modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.buffer.writer.queue_writer.html
trinity.buffer.writer.sql_writer modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.buffer.writer.sql_writer.html
trinity.buffer.buffer modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.buffer.buffer.html
trinity.buffer.buffer_reader modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.buffer.buffer_reader.html
trinity.buffer.buffer_writer modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.buffer.buffer_writer.html
trinity.buffer.task_scheduler modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.buffer.task_scheduler.html
trinity.buffer.utils modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.buffer.utils.html
trinity.buffer.viewer modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.buffer.viewer.html
trinity.explorer packagehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.explorer.html
trinity.explorer.proxy packagehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.explorer.proxy.html
trinity.explorer.proxy.app modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.explorer.proxy.app.html
trinity.explorer.proxy.client modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.explorer.proxy.client.html
trinity.explorer.proxy.recorder modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.explorer.proxy.recorder.html
trinity.explorer.proxy.service modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.explorer.proxy.service.html
trinity.explorer.explorer modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.explorer.explorer.html
trinity.explorer.scheduler modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.explorer.scheduler.html
trinity.explorer.workflow_runner modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.explorer.workflow_runner.html
trinity.trainer packagehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.trainer.html
trinity.trainer.tinker packagehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.trainer.tinker.html
trinity.trainer.tinker.utils modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.trainer.tinker.utils.html
trinity.trainer.verl packagehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.trainer.verl.html
trinity.trainer.verl.dp_actor modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.trainer.verl.dp_actor.html
trinity.trainer.verl.fsdp_checkpoint_manager modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.trainer.verl.fsdp_checkpoint_manager.html
trinity.trainer.verl.fsdp_workers modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.trainer.verl.fsdp_workers.html
trinity.trainer.verl.megatron_actor modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.trainer.verl.megatron_actor.html
trinity.trainer.verl.megatron_checkpoint_manager modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.trainer.verl.megatron_checkpoint_manager.html
trinity.trainer.verl.megatron_workers modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.trainer.verl.megatron_workers.html
trinity.trainer.verl.utils modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.trainer.verl.utils.html
trinity.trainer.tinker_trainer modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.trainer.tinker_trainer.html
trinity.trainer.trainer modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.trainer.trainer.html
trinity.trainer.verl_trainer modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.trainer.verl_trainer.html
trinity.algorithm packagehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.html
trinity.algorithm.advantage_fn packagehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.advantage_fn.html
trinity.algorithm.advantage_fn.advantage_fn modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.advantage_fn.advantage_fn.html
trinity.algorithm.advantage_fn.asymre_advantage modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.advantage_fn.asymre_advantage.html
trinity.algorithm.advantage_fn.grpo_advantage modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.advantage_fn.grpo_advantage.html
trinity.algorithm.advantage_fn.multi_step_grpo_advantage modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.advantage_fn.multi_step_grpo_advantage.html
trinity.algorithm.advantage_fn.on_policy_distill_advantage modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.advantage_fn.on_policy_distill_advantage.html
trinity.algorithm.advantage_fn.opmd_advantage modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.advantage_fn.opmd_advantage.html
trinity.algorithm.advantage_fn.ppo_advantage modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.advantage_fn.ppo_advantage.html
trinity.algorithm.advantage_fn.rec_advantage modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.advantage_fn.rec_advantage.html
trinity.algorithm.advantage_fn.reinforce_advantage modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.advantage_fn.reinforce_advantage.html
trinity.algorithm.advantage_fn.reinforce_plus_plus_advantage modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.advantage_fn.reinforce_plus_plus_advantage.html
trinity.algorithm.advantage_fn.remax_advantage modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.advantage_fn.remax_advantage.html
trinity.algorithm.advantage_fn.rloo_advantage modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.advantage_fn.rloo_advantage.html
trinity.algorithm.entropy_loss_fn packagehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.entropy_loss_fn.html
trinity.algorithm.entropy_loss_fn.entropy_loss_fn modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.entropy_loss_fn.entropy_loss_fn.html
trinity.algorithm.kl_fn packagehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.kl_fn.html
trinity.algorithm.kl_fn.kl_fn modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.kl_fn.kl_fn.html
trinity.algorithm.policy_loss_fn packagehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.policy_loss_fn.html
trinity.algorithm.policy_loss_fn.chord_policy_loss modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.policy_loss_fn.chord_policy_loss.html
trinity.algorithm.policy_loss_fn.cispo_policy_loss modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.policy_loss_fn.cispo_policy_loss.html
trinity.algorithm.policy_loss_fn.dpo_loss modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.policy_loss_fn.dpo_loss.html
trinity.algorithm.policy_loss_fn.gspo_policy_loss modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.policy_loss_fn.gspo_policy_loss.html
trinity.algorithm.policy_loss_fn.importance_sampling_policy_loss modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.policy_loss_fn.importance_sampling_policy_loss.html
trinity.algorithm.policy_loss_fn.mix_policy_loss modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.policy_loss_fn.mix_policy_loss.html
trinity.algorithm.policy_loss_fn.opmd_policy_loss modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.policy_loss_fn.opmd_policy_loss.html
trinity.algorithm.policy_loss_fn.policy_loss_fn modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.policy_loss_fn.policy_loss_fn.html
trinity.algorithm.policy_loss_fn.ppo_policy_loss modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.policy_loss_fn.ppo_policy_loss.html
trinity.algorithm.policy_loss_fn.rec_policy_loss modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.policy_loss_fn.rec_policy_loss.html
trinity.algorithm.policy_loss_fn.sapo_policy_loss modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.policy_loss_fn.sapo_policy_loss.html
trinity.algorithm.policy_loss_fn.sft_loss modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.policy_loss_fn.sft_loss.html
trinity.algorithm.policy_loss_fn.sppo_loss_fn modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.policy_loss_fn.sppo_loss_fn.html
trinity.algorithm.policy_loss_fn.topr_policy_loss modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.policy_loss_fn.topr_policy_loss.html
trinity.algorithm.sample_strategy packagehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.sample_strategy.html
trinity.algorithm.sample_strategy.mix_sample_strategy modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.sample_strategy.mix_sample_strategy.html
trinity.algorithm.sample_strategy.sample_strategy modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.sample_strategy.sample_strategy.html
trinity.algorithm.sample_strategy.utils modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.sample_strategy.utils.html
trinity.algorithm.algorithm modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html
trinity.algorithm.key_mapper modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.key_mapper.html
trinity.algorithm.utils modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.utils.html
trinity.manager packagehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.manager.html
trinity.manager.config_registry packagehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.manager.config_registry.html
trinity.manager.config_registry.algorithm_config_manager modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.manager.config_registry.algorithm_config_manager.html
trinity.manager.config_registry.buffer_config_manager modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.manager.config_registry.buffer_config_manager.html
trinity.manager.config_registry.config_registry modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.manager.config_registry.config_registry.html
trinity.manager.config_registry.explorer_config_manager modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.manager.config_registry.explorer_config_manager.html
trinity.manager.config_registry.model_config_manager modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.manager.config_registry.model_config_manager.html
trinity.manager.config_registry.trainer_config_manager modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.manager.config_registry.trainer_config_manager.html
trinity.manager.config_manager modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.manager.config_manager.html
trinity.manager.state_manager modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.manager.state_manager.html
trinity.manager.synchronizer modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.manager.synchronizer.html
trinity.common packagehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.common.html
trinity.common.models packagehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.common.models.html
trinity.common.models.vllm_patch packagehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.common.models.vllm_patch.html
trinity.common.models.mm_utils modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.common.models.mm_utils.html
trinity.common.models.model modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.common.models.model.html
trinity.common.models.tinker_model modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.common.models.tinker_model.html
trinity.common.models.utils modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.common.models.utils.html
trinity.common.models.vllm_model modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.common.models.vllm_model.html
trinity.common.models.vllm_worker modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.common.models.vllm_worker.html
trinity.common.rewards packagehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.common.rewards.html
trinity.common.rewards.accuracy_reward modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.common.rewards.accuracy_reward.html
trinity.common.rewards.agents_reward modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.common.rewards.agents_reward.html
trinity.common.rewards.countdown_reward modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.common.rewards.countdown_reward.html
trinity.common.rewards.dapo_reward modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.common.rewards.dapo_reward.html
trinity.common.rewards.eval_utils modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.common.rewards.eval_utils.html
trinity.common.rewards.format_reward modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.common.rewards.format_reward.html
trinity.common.rewards.human_reward modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.common.rewards.human_reward.html
trinity.common.rewards.math_reward modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.common.rewards.math_reward.html
trinity.common.rewards.naive_dapo_score modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.common.rewards.naive_dapo_score.html
trinity.common.rewards.qwen25_eval modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.common.rewards.qwen25_eval.html
trinity.common.rewards.reward_fn modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.common.rewards.reward_fn.html
trinity.common.rewards.tool_reward modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.common.rewards.tool_reward.html
trinity.common.rewards.utils modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.common.rewards.utils.html
trinity.common.workflows packagehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.common.workflows.html
trinity.common.workflows.agentscope packagehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.common.workflows.agentscope.html
trinity.common.workflows.agentscope_workflow modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.common.workflows.agentscope_workflow.html
trinity.common.workflows.customized_math_workflows modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.common.workflows.customized_math_workflows.html
trinity.common.workflows.customized_toolcall_workflows modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.common.workflows.customized_toolcall_workflows.html
trinity.common.workflows.eval_workflow modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.common.workflows.eval_workflow.html
trinity.common.workflows.math_rm_workflow modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.common.workflows.math_rm_workflow.html
trinity.common.workflows.math_ruler_workflow modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.common.workflows.math_ruler_workflow.html
trinity.common.workflows.math_trainable_ruler_workflow modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.common.workflows.math_trainable_ruler_workflow.html
trinity.common.workflows.on_policy_distill_workflow modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.common.workflows.on_policy_distill_workflow.html
trinity.common.workflows.rubric_judge_workflow modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.common.workflows.rubric_judge_workflow.html
trinity.common.workflows.simple_mm_workflow modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.common.workflows.simple_mm_workflow.html
trinity.common.workflows.step_wise_workflow modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.common.workflows.step_wise_workflow.html
trinity.common.workflows.workflow modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.common.workflows.workflow.html
trinity.common.config modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.common.config.html
trinity.common.constants modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.common.constants.html
trinity.common.experience modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.common.experience.html
trinity.common.verl_config modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.common.verl_config.html
trinity.utils packagehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.utils.html
trinity.utils.annotations modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.utils.annotations.html
trinity.utils.distributed modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.utils.distributed.html
trinity.utils.dlc_utils modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.utils.dlc_utils.html
trinity.utils.log modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.utils.log.html
trinity.utils.lora_utils modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.utils.lora_utils.html
trinity.utils.monitor modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.utils.monitor.html
trinity.utils.plugin_loader modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.utils.plugin_loader.html
trinity.utils.registry modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.utils.registry.html
trinity.utils.timer modulehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.utils.timer.html
https://github.com/modelscope/Trinity-RFT
.rst https://modelscope.github.io/Trinity-RFT/en/main/_sources/build_api/trinity.algorithm.algorithm.rst
ConstantMetahttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.ConstantMeta
AlgorithmTypehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.AlgorithmType
AlgorithmType.use_critichttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.AlgorithmType.use_critic
AlgorithmType.use_referencehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.AlgorithmType.use_reference
AlgorithmType.compute_advantage_in_trainerhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.AlgorithmType.compute_advantage_in_trainer
AlgorithmType.can_balance_batchhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.AlgorithmType.can_balance_batch
AlgorithmType.schemahttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.AlgorithmType.schema
AlgorithmType.default_config()https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.AlgorithmType.default_config
AlgorithmType.name()https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.AlgorithmType.name
AlgorithmType.check_config()https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.AlgorithmType.check_config
SFTAlgorithmhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.SFTAlgorithm
SFTAlgorithm.use_critichttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.SFTAlgorithm.use_critic
SFTAlgorithm.use_referencehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.SFTAlgorithm.use_reference
SFTAlgorithm.compute_advantage_in_trainerhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.SFTAlgorithm.compute_advantage_in_trainer
SFTAlgorithm.can_balance_batchhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.SFTAlgorithm.can_balance_batch
SFTAlgorithm.schemahttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.SFTAlgorithm.schema
SFTAlgorithm.default_config()https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.SFTAlgorithm.default_config
PPOAlgorithmhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.PPOAlgorithm
PPOAlgorithm.use_critichttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.PPOAlgorithm.use_critic
PPOAlgorithm.use_referencehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.PPOAlgorithm.use_reference
PPOAlgorithm.compute_advantage_in_trainerhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.PPOAlgorithm.compute_advantage_in_trainer
PPOAlgorithm.can_balance_batchhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.PPOAlgorithm.can_balance_batch
PPOAlgorithm.schemahttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.PPOAlgorithm.schema
PPOAlgorithm.default_config()https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.PPOAlgorithm.default_config
GRPOAlgorithmhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.GRPOAlgorithm
GRPOAlgorithm.use_critichttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.GRPOAlgorithm.use_critic
GRPOAlgorithm.use_referencehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.GRPOAlgorithm.use_reference
GRPOAlgorithm.compute_advantage_in_trainerhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.GRPOAlgorithm.compute_advantage_in_trainer
GRPOAlgorithm.can_balance_batchhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.GRPOAlgorithm.can_balance_batch
GRPOAlgorithm.schemahttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.GRPOAlgorithm.schema
GRPOAlgorithm.default_config()https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.GRPOAlgorithm.default_config
ReinforcePlusPlusAlgorithmhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.ReinforcePlusPlusAlgorithm
ReinforcePlusPlusAlgorithm.use_critichttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.ReinforcePlusPlusAlgorithm.use_critic
ReinforcePlusPlusAlgorithm.use_referencehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.ReinforcePlusPlusAlgorithm.use_reference
ReinforcePlusPlusAlgorithm.compute_advantage_in_trainerhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.ReinforcePlusPlusAlgorithm.compute_advantage_in_trainer
ReinforcePlusPlusAlgorithm.can_balance_batchhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.ReinforcePlusPlusAlgorithm.can_balance_batch
ReinforcePlusPlusAlgorithm.schemahttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.ReinforcePlusPlusAlgorithm.schema
ReinforcePlusPlusAlgorithm.default_config()https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.ReinforcePlusPlusAlgorithm.default_config
RLOOAlgorithmhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.RLOOAlgorithm
RLOOAlgorithm.use_critichttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.RLOOAlgorithm.use_critic
RLOOAlgorithm.use_referencehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.RLOOAlgorithm.use_reference
RLOOAlgorithm.compute_advantage_in_trainerhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.RLOOAlgorithm.compute_advantage_in_trainer
RLOOAlgorithm.can_balance_batchhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.RLOOAlgorithm.can_balance_batch
RLOOAlgorithm.schemahttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.RLOOAlgorithm.schema
RLOOAlgorithm.default_config()https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.RLOOAlgorithm.default_config
OPMDAlgorithmhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.OPMDAlgorithm
OPMDAlgorithm.use_critichttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.OPMDAlgorithm.use_critic
OPMDAlgorithm.use_referencehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.OPMDAlgorithm.use_reference
OPMDAlgorithm.compute_advantage_in_trainerhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.OPMDAlgorithm.compute_advantage_in_trainer
OPMDAlgorithm.can_balance_batchhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.OPMDAlgorithm.can_balance_batch
OPMDAlgorithm.schemahttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.OPMDAlgorithm.schema
OPMDAlgorithm.default_config()https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.OPMDAlgorithm.default_config
AsymREAlgorithmhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.AsymREAlgorithm
AsymREAlgorithm.use_critichttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.AsymREAlgorithm.use_critic
AsymREAlgorithm.use_referencehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.AsymREAlgorithm.use_reference
AsymREAlgorithm.compute_advantage_in_trainerhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.AsymREAlgorithm.compute_advantage_in_trainer
AsymREAlgorithm.can_balance_batchhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.AsymREAlgorithm.can_balance_batch
AsymREAlgorithm.schemahttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.AsymREAlgorithm.schema
AsymREAlgorithm.default_config()https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.AsymREAlgorithm.default_config
DPOAlgorithmhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.DPOAlgorithm
DPOAlgorithm.use_critichttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.DPOAlgorithm.use_critic
DPOAlgorithm.use_referencehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.DPOAlgorithm.use_reference
DPOAlgorithm.compute_advantage_in_trainerhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.DPOAlgorithm.compute_advantage_in_trainer
DPOAlgorithm.can_balance_batchhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.DPOAlgorithm.can_balance_batch
DPOAlgorithm.schemahttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.DPOAlgorithm.schema
DPOAlgorithm.default_config()https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.DPOAlgorithm.default_config
DPOAlgorithm.check_config()https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.DPOAlgorithm.check_config
TOPRAlgorithmhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.TOPRAlgorithm
TOPRAlgorithm.use_critichttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.TOPRAlgorithm.use_critic
TOPRAlgorithm.use_referencehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.TOPRAlgorithm.use_reference
TOPRAlgorithm.compute_advantage_in_trainerhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.TOPRAlgorithm.compute_advantage_in_trainer
TOPRAlgorithm.can_balance_batchhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.TOPRAlgorithm.can_balance_batch
TOPRAlgorithm.schemahttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.TOPRAlgorithm.schema
TOPRAlgorithm.default_config()https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.TOPRAlgorithm.default_config
CISPOAlgorithmhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.CISPOAlgorithm
CISPOAlgorithm.use_critichttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.CISPOAlgorithm.use_critic
CISPOAlgorithm.use_referencehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.CISPOAlgorithm.use_reference
CISPOAlgorithm.compute_advantage_in_trainerhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.CISPOAlgorithm.compute_advantage_in_trainer
CISPOAlgorithm.can_balance_batchhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.CISPOAlgorithm.can_balance_batch
CISPOAlgorithm.schemahttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.CISPOAlgorithm.schema
CISPOAlgorithm.default_config()https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.CISPOAlgorithm.default_config
GSPOAlgorithmhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.GSPOAlgorithm
GSPOAlgorithm.use_critichttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.GSPOAlgorithm.use_critic
GSPOAlgorithm.use_referencehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.GSPOAlgorithm.use_reference
GSPOAlgorithm.compute_advantage_in_trainerhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.GSPOAlgorithm.compute_advantage_in_trainer
GSPOAlgorithm.can_balance_batchhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.GSPOAlgorithm.can_balance_batch
GSPOAlgorithm.schemahttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.GSPOAlgorithm.schema
GSPOAlgorithm.default_config()https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.GSPOAlgorithm.default_config
SAPOAlgorithmhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.SAPOAlgorithm
SAPOAlgorithm.use_critichttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.SAPOAlgorithm.use_critic
SAPOAlgorithm.use_referencehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.SAPOAlgorithm.use_reference
SAPOAlgorithm.compute_advantage_in_trainerhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.SAPOAlgorithm.compute_advantage_in_trainer
SAPOAlgorithm.can_balance_batchhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.SAPOAlgorithm.can_balance_batch
SAPOAlgorithm.schemahttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.SAPOAlgorithm.schema
SAPOAlgorithm.default_config()https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.SAPOAlgorithm.default_config
MIXAlgorithmhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.MIXAlgorithm
MIXAlgorithm.use_critichttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.MIXAlgorithm.use_critic
MIXAlgorithm.use_referencehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.MIXAlgorithm.use_reference
MIXAlgorithm.compute_advantage_in_trainerhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.MIXAlgorithm.compute_advantage_in_trainer
MIXAlgorithm.use_rollouthttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.MIXAlgorithm.use_rollout
MIXAlgorithm.can_balance_batchhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.MIXAlgorithm.can_balance_batch
MIXAlgorithm.schemahttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.MIXAlgorithm.schema
MIXAlgorithm.default_config()https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.MIXAlgorithm.default_config
MIXCHORDAlgorithmhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.MIXCHORDAlgorithm
MIXCHORDAlgorithm.use_critichttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.MIXCHORDAlgorithm.use_critic
MIXCHORDAlgorithm.use_referencehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.MIXCHORDAlgorithm.use_reference
MIXCHORDAlgorithm.compute_advantage_in_trainerhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.MIXCHORDAlgorithm.compute_advantage_in_trainer
MIXCHORDAlgorithm.use_rollouthttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.MIXCHORDAlgorithm.use_rollout
MIXCHORDAlgorithm.can_balance_batchhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.MIXCHORDAlgorithm.can_balance_batch
MIXCHORDAlgorithm.schemahttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.MIXCHORDAlgorithm.schema
MIXCHORDAlgorithm.default_config()https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.MIXCHORDAlgorithm.default_config
RAFTAlgorithmhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.RAFTAlgorithm
RAFTAlgorithm.use_critichttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.RAFTAlgorithm.use_critic
RAFTAlgorithm.use_referencehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.RAFTAlgorithm.use_reference
RAFTAlgorithm.compute_advantage_in_trainerhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.RAFTAlgorithm.compute_advantage_in_trainer
RAFTAlgorithm.can_balance_batchhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.RAFTAlgorithm.can_balance_batch
RAFTAlgorithm.schemahttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.RAFTAlgorithm.schema
RAFTAlgorithm.default_config()https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.RAFTAlgorithm.default_config
sPPOAlgorithmhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.sPPOAlgorithm
sPPOAlgorithm.use_critichttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.sPPOAlgorithm.use_critic
sPPOAlgorithm.use_referencehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.sPPOAlgorithm.use_reference
sPPOAlgorithm.compute_advantage_in_trainerhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.sPPOAlgorithm.compute_advantage_in_trainer
sPPOAlgorithm.can_balance_batchhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.sPPOAlgorithm.can_balance_batch
sPPOAlgorithm.schemahttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.sPPOAlgorithm.schema
sPPOAlgorithm.default_config()https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.sPPOAlgorithm.default_config
RECAlgorithmhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.RECAlgorithm
RECAlgorithm.use_critichttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.RECAlgorithm.use_critic
RECAlgorithm.use_referencehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.RECAlgorithm.use_reference
RECAlgorithm.compute_advantage_in_trainerhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.RECAlgorithm.compute_advantage_in_trainer
RECAlgorithm.can_balance_batchhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.RECAlgorithm.can_balance_batch
RECAlgorithm.schemahttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.RECAlgorithm.schema
RECAlgorithm.default_config()https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.RECAlgorithm.default_config
MultiStepGRPOAlgorithmhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.MultiStepGRPOAlgorithm
MultiStepGRPOAlgorithm.use_critichttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.MultiStepGRPOAlgorithm.use_critic
MultiStepGRPOAlgorithm.use_referencehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.MultiStepGRPOAlgorithm.use_reference
MultiStepGRPOAlgorithm.compute_advantage_in_trainerhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.MultiStepGRPOAlgorithm.compute_advantage_in_trainer
MultiStepGRPOAlgorithm.can_balance_batchhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.MultiStepGRPOAlgorithm.can_balance_batch
MultiStepGRPOAlgorithm.schemahttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.MultiStepGRPOAlgorithm.schema
MultiStepGRPOAlgorithm.default_config()https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.MultiStepGRPOAlgorithm.default_config
OnPolicyDistillAlgorithmhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.OnPolicyDistillAlgorithm
OnPolicyDistillAlgorithm.use_critichttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.OnPolicyDistillAlgorithm.use_critic
OnPolicyDistillAlgorithm.use_referencehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.OnPolicyDistillAlgorithm.use_reference
OnPolicyDistillAlgorithm.compute_advantage_in_trainerhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.OnPolicyDistillAlgorithm.compute_advantage_in_trainer
OnPolicyDistillAlgorithm.can_balance_batchhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.OnPolicyDistillAlgorithm.can_balance_batch
OnPolicyDistillAlgorithm.schemahttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.OnPolicyDistillAlgorithm.schema
OnPolicyDistillAlgorithm.default_config()https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.OnPolicyDistillAlgorithm.default_config
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#module-trinity.algorithm.algorithm
[source]https://modelscope.github.io/Trinity-RFT/en/main/_modules/trinity/algorithm/algorithm.html#ConstantMeta
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.ConstantMeta
[source]https://modelscope.github.io/Trinity-RFT/en/main/_modules/trinity/algorithm/algorithm.html#AlgorithmType
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.AlgorithmType
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.AlgorithmType.use_critic
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.AlgorithmType.use_reference
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.AlgorithmType.compute_advantage_in_trainer
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.AlgorithmType.can_balance_batch
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.AlgorithmType.schema
[source]https://modelscope.github.io/Trinity-RFT/en/main/_modules/trinity/algorithm/algorithm.html#AlgorithmType.default_config
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.AlgorithmType.default_config
[source]https://modelscope.github.io/Trinity-RFT/en/main/_modules/trinity/algorithm/algorithm.html#AlgorithmType.name
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.AlgorithmType.name
Confighttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.common.config.html#trinity.common.config.Config
[source]https://modelscope.github.io/Trinity-RFT/en/main/_modules/trinity/algorithm/algorithm.html#AlgorithmType.check_config
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.AlgorithmType.check_config
[source]https://modelscope.github.io/Trinity-RFT/en/main/_modules/trinity/algorithm/algorithm.html#SFTAlgorithm
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.SFTAlgorithm
AlgorithmTypehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.AlgorithmType
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.SFTAlgorithm.use_critic
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.SFTAlgorithm.use_reference
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.SFTAlgorithm.compute_advantage_in_trainer
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.SFTAlgorithm.can_balance_batch
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.SFTAlgorithm.schema
[source]https://modelscope.github.io/Trinity-RFT/en/main/_modules/trinity/algorithm/algorithm.html#SFTAlgorithm.default_config
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.SFTAlgorithm.default_config
[source]https://modelscope.github.io/Trinity-RFT/en/main/_modules/trinity/algorithm/algorithm.html#PPOAlgorithm
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.PPOAlgorithm
AlgorithmTypehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.AlgorithmType
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.PPOAlgorithm.use_critic
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.PPOAlgorithm.use_reference
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.PPOAlgorithm.compute_advantage_in_trainer
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.PPOAlgorithm.can_balance_batch
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.PPOAlgorithm.schema
[source]https://modelscope.github.io/Trinity-RFT/en/main/_modules/trinity/algorithm/algorithm.html#PPOAlgorithm.default_config
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.PPOAlgorithm.default_config
[source]https://modelscope.github.io/Trinity-RFT/en/main/_modules/trinity/algorithm/algorithm.html#GRPOAlgorithm
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.GRPOAlgorithm
AlgorithmTypehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.AlgorithmType
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.GRPOAlgorithm.use_critic
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.GRPOAlgorithm.use_reference
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.GRPOAlgorithm.compute_advantage_in_trainer
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.GRPOAlgorithm.can_balance_batch
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.GRPOAlgorithm.schema
[source]https://modelscope.github.io/Trinity-RFT/en/main/_modules/trinity/algorithm/algorithm.html#GRPOAlgorithm.default_config
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.GRPOAlgorithm.default_config
[source]https://modelscope.github.io/Trinity-RFT/en/main/_modules/trinity/algorithm/algorithm.html#ReinforcePlusPlusAlgorithm
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.ReinforcePlusPlusAlgorithm
AlgorithmTypehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.AlgorithmType
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.ReinforcePlusPlusAlgorithm.use_critic
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.ReinforcePlusPlusAlgorithm.use_reference
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.ReinforcePlusPlusAlgorithm.compute_advantage_in_trainer
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.ReinforcePlusPlusAlgorithm.can_balance_batch
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.ReinforcePlusPlusAlgorithm.schema
[source]https://modelscope.github.io/Trinity-RFT/en/main/_modules/trinity/algorithm/algorithm.html#ReinforcePlusPlusAlgorithm.default_config
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.ReinforcePlusPlusAlgorithm.default_config
[source]https://modelscope.github.io/Trinity-RFT/en/main/_modules/trinity/algorithm/algorithm.html#RLOOAlgorithm
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.RLOOAlgorithm
AlgorithmTypehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.AlgorithmType
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.RLOOAlgorithm.use_critic
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.RLOOAlgorithm.use_reference
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.RLOOAlgorithm.compute_advantage_in_trainer
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.RLOOAlgorithm.can_balance_batch
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.RLOOAlgorithm.schema
[source]https://modelscope.github.io/Trinity-RFT/en/main/_modules/trinity/algorithm/algorithm.html#RLOOAlgorithm.default_config
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.RLOOAlgorithm.default_config
[source]https://modelscope.github.io/Trinity-RFT/en/main/_modules/trinity/algorithm/algorithm.html#OPMDAlgorithm
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.OPMDAlgorithm
AlgorithmTypehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.AlgorithmType
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.OPMDAlgorithm.use_critic
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.OPMDAlgorithm.use_reference
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.OPMDAlgorithm.compute_advantage_in_trainer
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.OPMDAlgorithm.can_balance_batch
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.OPMDAlgorithm.schema
[source]https://modelscope.github.io/Trinity-RFT/en/main/_modules/trinity/algorithm/algorithm.html#OPMDAlgorithm.default_config
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.OPMDAlgorithm.default_config
[source]https://modelscope.github.io/Trinity-RFT/en/main/_modules/trinity/algorithm/algorithm.html#AsymREAlgorithm
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.AsymREAlgorithm
AlgorithmTypehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.AlgorithmType
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.AsymREAlgorithm.use_critic
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.AsymREAlgorithm.use_reference
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.AsymREAlgorithm.compute_advantage_in_trainer
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.AsymREAlgorithm.can_balance_batch
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.AsymREAlgorithm.schema
[source]https://modelscope.github.io/Trinity-RFT/en/main/_modules/trinity/algorithm/algorithm.html#AsymREAlgorithm.default_config
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.AsymREAlgorithm.default_config
[source]https://modelscope.github.io/Trinity-RFT/en/main/_modules/trinity/algorithm/algorithm.html#DPOAlgorithm
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.DPOAlgorithm
AlgorithmTypehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.AlgorithmType
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.DPOAlgorithm.use_critic
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.DPOAlgorithm.use_reference
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.DPOAlgorithm.compute_advantage_in_trainer
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.DPOAlgorithm.can_balance_batch
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.DPOAlgorithm.schema
[source]https://modelscope.github.io/Trinity-RFT/en/main/_modules/trinity/algorithm/algorithm.html#DPOAlgorithm.default_config
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.DPOAlgorithm.default_config
Confighttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.common.config.html#trinity.common.config.Config
[source]https://modelscope.github.io/Trinity-RFT/en/main/_modules/trinity/algorithm/algorithm.html#DPOAlgorithm.check_config
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.DPOAlgorithm.check_config
[source]https://modelscope.github.io/Trinity-RFT/en/main/_modules/trinity/algorithm/algorithm.html#TOPRAlgorithm
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.TOPRAlgorithm
AlgorithmTypehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.AlgorithmType
https://arxiv.org/pdf/2503.14286v1https://arxiv.org/pdf/2503.14286v1
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.TOPRAlgorithm.use_critic
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.TOPRAlgorithm.use_reference
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.TOPRAlgorithm.compute_advantage_in_trainer
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.TOPRAlgorithm.can_balance_batch
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.TOPRAlgorithm.schema
[source]https://modelscope.github.io/Trinity-RFT/en/main/_modules/trinity/algorithm/algorithm.html#TOPRAlgorithm.default_config
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.TOPRAlgorithm.default_config
[source]https://modelscope.github.io/Trinity-RFT/en/main/_modules/trinity/algorithm/algorithm.html#CISPOAlgorithm
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.CISPOAlgorithm
AlgorithmTypehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.AlgorithmType
https://arxiv.org/abs/2506.13585https://arxiv.org/abs/2506.13585
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.CISPOAlgorithm.use_critic
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.CISPOAlgorithm.use_reference
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.CISPOAlgorithm.compute_advantage_in_trainer
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.CISPOAlgorithm.can_balance_batch
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.CISPOAlgorithm.schema
[source]https://modelscope.github.io/Trinity-RFT/en/main/_modules/trinity/algorithm/algorithm.html#CISPOAlgorithm.default_config
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.CISPOAlgorithm.default_config
[source]https://modelscope.github.io/Trinity-RFT/en/main/_modules/trinity/algorithm/algorithm.html#GSPOAlgorithm
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.GSPOAlgorithm
AlgorithmTypehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.AlgorithmType
https://arxiv.org/pdf/2507.18071https://arxiv.org/pdf/2507.18071
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.GSPOAlgorithm.use_critic
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.GSPOAlgorithm.use_reference
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.GSPOAlgorithm.compute_advantage_in_trainer
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.GSPOAlgorithm.can_balance_batch
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.GSPOAlgorithm.schema
[source]https://modelscope.github.io/Trinity-RFT/en/main/_modules/trinity/algorithm/algorithm.html#GSPOAlgorithm.default_config
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.GSPOAlgorithm.default_config
[source]https://modelscope.github.io/Trinity-RFT/en/main/_modules/trinity/algorithm/algorithm.html#SAPOAlgorithm
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.SAPOAlgorithm
AlgorithmTypehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.AlgorithmType
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.SAPOAlgorithm.use_critic
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.SAPOAlgorithm.use_reference
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.SAPOAlgorithm.compute_advantage_in_trainer
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.SAPOAlgorithm.can_balance_batch
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.SAPOAlgorithm.schema
[source]https://modelscope.github.io/Trinity-RFT/en/main/_modules/trinity/algorithm/algorithm.html#SAPOAlgorithm.default_config
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.SAPOAlgorithm.default_config
[source]https://modelscope.github.io/Trinity-RFT/en/main/_modules/trinity/algorithm/algorithm.html#MIXAlgorithm
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.MIXAlgorithm
AlgorithmTypehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.AlgorithmType
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.MIXAlgorithm.use_critic
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.MIXAlgorithm.use_reference
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.MIXAlgorithm.compute_advantage_in_trainer
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.MIXAlgorithm.use_rollout
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.MIXAlgorithm.can_balance_batch
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.MIXAlgorithm.schema
[source]https://modelscope.github.io/Trinity-RFT/en/main/_modules/trinity/algorithm/algorithm.html#MIXAlgorithm.default_config
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.MIXAlgorithm.default_config
[source]https://modelscope.github.io/Trinity-RFT/en/main/_modules/trinity/algorithm/algorithm.html#MIXCHORDAlgorithm
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.MIXCHORDAlgorithm
AlgorithmTypehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.AlgorithmType
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.MIXCHORDAlgorithm.use_critic
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.MIXCHORDAlgorithm.use_reference
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.MIXCHORDAlgorithm.compute_advantage_in_trainer
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.MIXCHORDAlgorithm.use_rollout
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.MIXCHORDAlgorithm.can_balance_batch
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.MIXCHORDAlgorithm.schema
[source]https://modelscope.github.io/Trinity-RFT/en/main/_modules/trinity/algorithm/algorithm.html#MIXCHORDAlgorithm.default_config
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.MIXCHORDAlgorithm.default_config
[source]https://modelscope.github.io/Trinity-RFT/en/main/_modules/trinity/algorithm/algorithm.html#RAFTAlgorithm
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.RAFTAlgorithm
AlgorithmTypehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.AlgorithmType
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.RAFTAlgorithm.use_critic
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.RAFTAlgorithm.use_reference
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.RAFTAlgorithm.compute_advantage_in_trainer
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.RAFTAlgorithm.can_balance_batch
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.RAFTAlgorithm.schema
[source]https://modelscope.github.io/Trinity-RFT/en/main/_modules/trinity/algorithm/algorithm.html#RAFTAlgorithm.default_config
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.RAFTAlgorithm.default_config
[source]https://modelscope.github.io/Trinity-RFT/en/main/_modules/trinity/algorithm/algorithm.html#sPPOAlgorithm
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.sPPOAlgorithm
AlgorithmTypehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.AlgorithmType
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.sPPOAlgorithm.use_critic
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.sPPOAlgorithm.use_reference
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.sPPOAlgorithm.compute_advantage_in_trainer
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.sPPOAlgorithm.can_balance_batch
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.sPPOAlgorithm.schema
[source]https://modelscope.github.io/Trinity-RFT/en/main/_modules/trinity/algorithm/algorithm.html#sPPOAlgorithm.default_config
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.sPPOAlgorithm.default_config
[source]https://modelscope.github.io/Trinity-RFT/en/main/_modules/trinity/algorithm/algorithm.html#RECAlgorithm
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.RECAlgorithm
AlgorithmTypehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.AlgorithmType
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.RECAlgorithm.use_critic
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.RECAlgorithm.use_reference
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.RECAlgorithm.compute_advantage_in_trainer
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.RECAlgorithm.can_balance_batch
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.RECAlgorithm.schema
[source]https://modelscope.github.io/Trinity-RFT/en/main/_modules/trinity/algorithm/algorithm.html#RECAlgorithm.default_config
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.RECAlgorithm.default_config
[source]https://modelscope.github.io/Trinity-RFT/en/main/_modules/trinity/algorithm/algorithm.html#MultiStepGRPOAlgorithm
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.MultiStepGRPOAlgorithm
AlgorithmTypehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.AlgorithmType
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.MultiStepGRPOAlgorithm.use_critic
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.MultiStepGRPOAlgorithm.use_reference
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.MultiStepGRPOAlgorithm.compute_advantage_in_trainer
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.MultiStepGRPOAlgorithm.can_balance_batch
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.MultiStepGRPOAlgorithm.schema
[source]https://modelscope.github.io/Trinity-RFT/en/main/_modules/trinity/algorithm/algorithm.html#MultiStepGRPOAlgorithm.default_config
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.MultiStepGRPOAlgorithm.default_config
[source]https://modelscope.github.io/Trinity-RFT/en/main/_modules/trinity/algorithm/algorithm.html#OnPolicyDistillAlgorithm
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.OnPolicyDistillAlgorithm
AlgorithmTypehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.AlgorithmType
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.OnPolicyDistillAlgorithm.use_critic
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.OnPolicyDistillAlgorithm.use_reference
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.OnPolicyDistillAlgorithm.compute_advantage_in_trainer
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.OnPolicyDistillAlgorithm.can_balance_batch
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.OnPolicyDistillAlgorithm.schema
[source]https://modelscope.github.io/Trinity-RFT/en/main/_modules/trinity/algorithm/algorithm.html#OnPolicyDistillAlgorithm.default_config
#https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.OnPolicyDistillAlgorithm.default_config
previous trinity.algorithm.sample_strategy.utils module https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.sample_strategy.utils.html
next trinity.algorithm.key_mapper module https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.key_mapper.html
ConstantMetahttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.ConstantMeta
AlgorithmTypehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.AlgorithmType
AlgorithmType.use_critichttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.AlgorithmType.use_critic
AlgorithmType.use_referencehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.AlgorithmType.use_reference
AlgorithmType.compute_advantage_in_trainerhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.AlgorithmType.compute_advantage_in_trainer
AlgorithmType.can_balance_batchhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.AlgorithmType.can_balance_batch
AlgorithmType.schemahttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.AlgorithmType.schema
AlgorithmType.default_config()https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.AlgorithmType.default_config
AlgorithmType.name()https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.AlgorithmType.name
AlgorithmType.check_config()https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.AlgorithmType.check_config
SFTAlgorithmhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.SFTAlgorithm
SFTAlgorithm.use_critichttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.SFTAlgorithm.use_critic
SFTAlgorithm.use_referencehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.SFTAlgorithm.use_reference
SFTAlgorithm.compute_advantage_in_trainerhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.SFTAlgorithm.compute_advantage_in_trainer
SFTAlgorithm.can_balance_batchhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.SFTAlgorithm.can_balance_batch
SFTAlgorithm.schemahttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.SFTAlgorithm.schema
SFTAlgorithm.default_config()https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.SFTAlgorithm.default_config
PPOAlgorithmhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.PPOAlgorithm
PPOAlgorithm.use_critichttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.PPOAlgorithm.use_critic
PPOAlgorithm.use_referencehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.PPOAlgorithm.use_reference
PPOAlgorithm.compute_advantage_in_trainerhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.PPOAlgorithm.compute_advantage_in_trainer
PPOAlgorithm.can_balance_batchhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.PPOAlgorithm.can_balance_batch
PPOAlgorithm.schemahttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.PPOAlgorithm.schema
PPOAlgorithm.default_config()https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.PPOAlgorithm.default_config
GRPOAlgorithmhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.GRPOAlgorithm
GRPOAlgorithm.use_critichttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.GRPOAlgorithm.use_critic
GRPOAlgorithm.use_referencehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.GRPOAlgorithm.use_reference
GRPOAlgorithm.compute_advantage_in_trainerhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.GRPOAlgorithm.compute_advantage_in_trainer
GRPOAlgorithm.can_balance_batchhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.GRPOAlgorithm.can_balance_batch
GRPOAlgorithm.schemahttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.GRPOAlgorithm.schema
GRPOAlgorithm.default_config()https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.GRPOAlgorithm.default_config
ReinforcePlusPlusAlgorithmhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.ReinforcePlusPlusAlgorithm
ReinforcePlusPlusAlgorithm.use_critichttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.ReinforcePlusPlusAlgorithm.use_critic
ReinforcePlusPlusAlgorithm.use_referencehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.ReinforcePlusPlusAlgorithm.use_reference
ReinforcePlusPlusAlgorithm.compute_advantage_in_trainerhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.ReinforcePlusPlusAlgorithm.compute_advantage_in_trainer
ReinforcePlusPlusAlgorithm.can_balance_batchhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.ReinforcePlusPlusAlgorithm.can_balance_batch
ReinforcePlusPlusAlgorithm.schemahttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.ReinforcePlusPlusAlgorithm.schema
ReinforcePlusPlusAlgorithm.default_config()https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.ReinforcePlusPlusAlgorithm.default_config
RLOOAlgorithmhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.RLOOAlgorithm
RLOOAlgorithm.use_critichttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.RLOOAlgorithm.use_critic
RLOOAlgorithm.use_referencehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.RLOOAlgorithm.use_reference
RLOOAlgorithm.compute_advantage_in_trainerhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.RLOOAlgorithm.compute_advantage_in_trainer
RLOOAlgorithm.can_balance_batchhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.RLOOAlgorithm.can_balance_batch
RLOOAlgorithm.schemahttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.RLOOAlgorithm.schema
RLOOAlgorithm.default_config()https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.RLOOAlgorithm.default_config
OPMDAlgorithmhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.OPMDAlgorithm
OPMDAlgorithm.use_critichttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.OPMDAlgorithm.use_critic
OPMDAlgorithm.use_referencehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.OPMDAlgorithm.use_reference
OPMDAlgorithm.compute_advantage_in_trainerhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.OPMDAlgorithm.compute_advantage_in_trainer
OPMDAlgorithm.can_balance_batchhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.OPMDAlgorithm.can_balance_batch
OPMDAlgorithm.schemahttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.OPMDAlgorithm.schema
OPMDAlgorithm.default_config()https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.OPMDAlgorithm.default_config
AsymREAlgorithmhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.AsymREAlgorithm
AsymREAlgorithm.use_critichttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.AsymREAlgorithm.use_critic
AsymREAlgorithm.use_referencehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.AsymREAlgorithm.use_reference
AsymREAlgorithm.compute_advantage_in_trainerhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.AsymREAlgorithm.compute_advantage_in_trainer
AsymREAlgorithm.can_balance_batchhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.AsymREAlgorithm.can_balance_batch
AsymREAlgorithm.schemahttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.AsymREAlgorithm.schema
AsymREAlgorithm.default_config()https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.AsymREAlgorithm.default_config
DPOAlgorithmhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.DPOAlgorithm
DPOAlgorithm.use_critichttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.DPOAlgorithm.use_critic
DPOAlgorithm.use_referencehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.DPOAlgorithm.use_reference
DPOAlgorithm.compute_advantage_in_trainerhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.DPOAlgorithm.compute_advantage_in_trainer
DPOAlgorithm.can_balance_batchhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.DPOAlgorithm.can_balance_batch
DPOAlgorithm.schemahttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.DPOAlgorithm.schema
DPOAlgorithm.default_config()https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.DPOAlgorithm.default_config
DPOAlgorithm.check_config()https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.DPOAlgorithm.check_config
TOPRAlgorithmhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.TOPRAlgorithm
TOPRAlgorithm.use_critichttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.TOPRAlgorithm.use_critic
TOPRAlgorithm.use_referencehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.TOPRAlgorithm.use_reference
TOPRAlgorithm.compute_advantage_in_trainerhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.TOPRAlgorithm.compute_advantage_in_trainer
TOPRAlgorithm.can_balance_batchhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.TOPRAlgorithm.can_balance_batch
TOPRAlgorithm.schemahttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.TOPRAlgorithm.schema
TOPRAlgorithm.default_config()https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.TOPRAlgorithm.default_config
CISPOAlgorithmhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.CISPOAlgorithm
CISPOAlgorithm.use_critichttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.CISPOAlgorithm.use_critic
CISPOAlgorithm.use_referencehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.CISPOAlgorithm.use_reference
CISPOAlgorithm.compute_advantage_in_trainerhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.CISPOAlgorithm.compute_advantage_in_trainer
CISPOAlgorithm.can_balance_batchhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.CISPOAlgorithm.can_balance_batch
CISPOAlgorithm.schemahttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.CISPOAlgorithm.schema
CISPOAlgorithm.default_config()https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.CISPOAlgorithm.default_config
GSPOAlgorithmhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.GSPOAlgorithm
GSPOAlgorithm.use_critichttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.GSPOAlgorithm.use_critic
GSPOAlgorithm.use_referencehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.GSPOAlgorithm.use_reference
GSPOAlgorithm.compute_advantage_in_trainerhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.GSPOAlgorithm.compute_advantage_in_trainer
GSPOAlgorithm.can_balance_batchhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.GSPOAlgorithm.can_balance_batch
GSPOAlgorithm.schemahttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.GSPOAlgorithm.schema
GSPOAlgorithm.default_config()https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.GSPOAlgorithm.default_config
SAPOAlgorithmhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.SAPOAlgorithm
SAPOAlgorithm.use_critichttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.SAPOAlgorithm.use_critic
SAPOAlgorithm.use_referencehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.SAPOAlgorithm.use_reference
SAPOAlgorithm.compute_advantage_in_trainerhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.SAPOAlgorithm.compute_advantage_in_trainer
SAPOAlgorithm.can_balance_batchhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.SAPOAlgorithm.can_balance_batch
SAPOAlgorithm.schemahttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.SAPOAlgorithm.schema
SAPOAlgorithm.default_config()https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.SAPOAlgorithm.default_config
MIXAlgorithmhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.MIXAlgorithm
MIXAlgorithm.use_critichttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.MIXAlgorithm.use_critic
MIXAlgorithm.use_referencehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.MIXAlgorithm.use_reference
MIXAlgorithm.compute_advantage_in_trainerhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.MIXAlgorithm.compute_advantage_in_trainer
MIXAlgorithm.use_rollouthttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.MIXAlgorithm.use_rollout
MIXAlgorithm.can_balance_batchhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.MIXAlgorithm.can_balance_batch
MIXAlgorithm.schemahttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.MIXAlgorithm.schema
MIXAlgorithm.default_config()https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.MIXAlgorithm.default_config
MIXCHORDAlgorithmhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.MIXCHORDAlgorithm
MIXCHORDAlgorithm.use_critichttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.MIXCHORDAlgorithm.use_critic
MIXCHORDAlgorithm.use_referencehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.MIXCHORDAlgorithm.use_reference
MIXCHORDAlgorithm.compute_advantage_in_trainerhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.MIXCHORDAlgorithm.compute_advantage_in_trainer
MIXCHORDAlgorithm.use_rollouthttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.MIXCHORDAlgorithm.use_rollout
MIXCHORDAlgorithm.can_balance_batchhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.MIXCHORDAlgorithm.can_balance_batch
MIXCHORDAlgorithm.schemahttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.MIXCHORDAlgorithm.schema
MIXCHORDAlgorithm.default_config()https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.MIXCHORDAlgorithm.default_config
RAFTAlgorithmhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.RAFTAlgorithm
RAFTAlgorithm.use_critichttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.RAFTAlgorithm.use_critic
RAFTAlgorithm.use_referencehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.RAFTAlgorithm.use_reference
RAFTAlgorithm.compute_advantage_in_trainerhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.RAFTAlgorithm.compute_advantage_in_trainer
RAFTAlgorithm.can_balance_batchhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.RAFTAlgorithm.can_balance_batch
RAFTAlgorithm.schemahttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.RAFTAlgorithm.schema
RAFTAlgorithm.default_config()https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.RAFTAlgorithm.default_config
sPPOAlgorithmhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.sPPOAlgorithm
sPPOAlgorithm.use_critichttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.sPPOAlgorithm.use_critic
sPPOAlgorithm.use_referencehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.sPPOAlgorithm.use_reference
sPPOAlgorithm.compute_advantage_in_trainerhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.sPPOAlgorithm.compute_advantage_in_trainer
sPPOAlgorithm.can_balance_batchhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.sPPOAlgorithm.can_balance_batch
sPPOAlgorithm.schemahttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.sPPOAlgorithm.schema
sPPOAlgorithm.default_config()https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.sPPOAlgorithm.default_config
RECAlgorithmhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.RECAlgorithm
RECAlgorithm.use_critichttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.RECAlgorithm.use_critic
RECAlgorithm.use_referencehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.RECAlgorithm.use_reference
RECAlgorithm.compute_advantage_in_trainerhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.RECAlgorithm.compute_advantage_in_trainer
RECAlgorithm.can_balance_batchhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.RECAlgorithm.can_balance_batch
RECAlgorithm.schemahttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.RECAlgorithm.schema
RECAlgorithm.default_config()https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.RECAlgorithm.default_config
MultiStepGRPOAlgorithmhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.MultiStepGRPOAlgorithm
MultiStepGRPOAlgorithm.use_critichttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.MultiStepGRPOAlgorithm.use_critic
MultiStepGRPOAlgorithm.use_referencehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.MultiStepGRPOAlgorithm.use_reference
MultiStepGRPOAlgorithm.compute_advantage_in_trainerhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.MultiStepGRPOAlgorithm.compute_advantage_in_trainer
MultiStepGRPOAlgorithm.can_balance_batchhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.MultiStepGRPOAlgorithm.can_balance_batch
MultiStepGRPOAlgorithm.schemahttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.MultiStepGRPOAlgorithm.schema
MultiStepGRPOAlgorithm.default_config()https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.MultiStepGRPOAlgorithm.default_config
OnPolicyDistillAlgorithmhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.OnPolicyDistillAlgorithm
OnPolicyDistillAlgorithm.use_critichttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.OnPolicyDistillAlgorithm.use_critic
OnPolicyDistillAlgorithm.use_referencehttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.OnPolicyDistillAlgorithm.use_reference
OnPolicyDistillAlgorithm.compute_advantage_in_trainerhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.OnPolicyDistillAlgorithm.compute_advantage_in_trainer
OnPolicyDistillAlgorithm.can_balance_batchhttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.OnPolicyDistillAlgorithm.can_balance_batch
OnPolicyDistillAlgorithm.schemahttps://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.OnPolicyDistillAlgorithm.schema
OnPolicyDistillAlgorithm.default_config()https://modelscope.github.io/Trinity-RFT/en/main/build_api/trinity.algorithm.algorithm.html#trinity.algorithm.algorithm.OnPolicyDistillAlgorithm.default_config

Viewport: width=device-width, initial-scale=1


URLs of crawlers that visited me.