Matches in Nanopublications for { ?s ?p ?o <https://w3id.org/np/RAMmlNyKev37Gbg7UWMXcFz7n75r_vyq1giyo4LxPpvI0/assertion>. }
Showing items 1 to 44 of
44
with 100 items per page.
- arXiv.2508.15790 type Entity assertion.
- ChatGPT4o type Workflow assertion.
- ChatGPT4oMini type Workflow assertion.
- CoTChainOfThought type Workflow assertion.
- DPO type Workflow assertion.
- DeepSeekR1 type Workflow assertion.
- DeepSeekR1DistillQwen14B type Workflow assertion.
- GRPO type Workflow assertion.
- Gemini20FlashThinking type Workflow assertion.
- KGo1 type Workflow assertion.
- Llama318BInstruct type Workflow assertion.
- OpenO1 type Workflow assertion.
- Qwen2514BInstruct type Workflow assertion.
- Qwen257BInstruct type Workflow assertion.
- ChatGPT4o label "ChatGPT-4o" assertion.
- ChatGPT4oMini label "ChatGPT4o-mini" assertion.
- CoTChainOfThought label "CoT (Chain-of-Thought)" assertion.
- DPO label "DPO (Direct Preference Optimization)" assertion.
- DeepSeekR1 label "DeepSeek-R1" assertion.
- DeepSeekR1DistillQwen14B label "DeepSeek-R1-Distill-Qwen-14B" assertion.
- GRPO label "GRPO" assertion.
- Gemini20FlashThinking label "Gemini 2.0 Flash Thinking" assertion.
- KGo1 label "KG-o1" assertion.
- Llama318BInstruct label "Llama3.1-8B-Instruct" assertion.
- OpenO1 label "Open-o1" assertion.
- Qwen2514BInstruct label "Qwen2.5-14B-Instruct" assertion.
- Qwen257BInstruct label "Qwen2.5-7B-Instruct" assertion.
- KGo1 comment "KG-o1 is a four-stage framework designed to enhance the intrinsic multi-hop reasoning abilities of LLMs. It leverages KGs to filter entities, generate logical paths, construct complex QA datasets for supervised fine-tuning (SFT) to simulate long-term thinking, and applies a Self-improved Adaptive DPO strategy to refine the LLMs' reasoning, ultimately improving LLM performance during inference for multi-hop question answering." assertion.
- arXiv.2508.15790 describes KGo1 assertion.
- arXiv.2508.15790 discusses ChatGPT4o assertion.
- arXiv.2508.15790 discusses ChatGPT4oMini assertion.
- arXiv.2508.15790 discusses CoTChainOfThought assertion.
- arXiv.2508.15790 discusses DPO assertion.
- arXiv.2508.15790 discusses DeepSeekR1 assertion.
- arXiv.2508.15790 discusses DeepSeekR1DistillQwen14B assertion.
- arXiv.2508.15790 discusses GRPO assertion.
- arXiv.2508.15790 discusses Gemini20FlashThinking assertion.
- arXiv.2508.15790 discusses Llama318BInstruct assertion.
- arXiv.2508.15790 discusses OpenO1 assertion.
- arXiv.2508.15790 discusses Qwen2514BInstruct assertion.
- arXiv.2508.15790 discusses Qwen257BInstruct assertion.
- KGo1 subject KGEnhancedLLMInference assertion.
- arXiv.2508.15790 title "KG-o1: Enhancing Multi-hop Question Answering in Large Language Models via Knowledge Graph Integration" assertion.
- KGo1 hasTopCategory KGEnhancedLLM assertion.