Nanopublications

Matches in Nanopublications for { ?s ?p ?o <https://w3id.org/np/RAMmlNyKev37Gbg7UWMXcFz7n75r_vyq1giyo4LxPpvI0/assertion>. }

Showing items 1 to 44 of 44 with 100 items per page.

arXiv.2508.15790 type Entity assertion.
ChatGPT4o type Workflow assertion.
ChatGPT4oMini type Workflow assertion.
CoTChainOfThought type Workflow assertion.
DPO type Workflow assertion.
DeepSeekR1 type Workflow assertion.
DeepSeekR1DistillQwen14B type Workflow assertion.
GRPO type Workflow assertion.
Gemini20FlashThinking type Workflow assertion.
KGo1 type Workflow assertion.
Llama318BInstruct type Workflow assertion.
OpenO1 type Workflow assertion.
Qwen2514BInstruct type Workflow assertion.
Qwen257BInstruct type Workflow assertion.
ChatGPT4o label "ChatGPT-4o" assertion.
ChatGPT4oMini label "ChatGPT4o-mini" assertion.
CoTChainOfThought label "CoT (Chain-of-Thought)" assertion.
DPO label "DPO (Direct Preference Optimization)" assertion.
DeepSeekR1 label "DeepSeek-R1" assertion.
DeepSeekR1DistillQwen14B label "DeepSeek-R1-Distill-Qwen-14B" assertion.
GRPO label "GRPO" assertion.
Gemini20FlashThinking label "Gemini 2.0 Flash Thinking" assertion.
KGo1 label "KG-o1" assertion.
Llama318BInstruct label "Llama3.1-8B-Instruct" assertion.
OpenO1 label "Open-o1" assertion.
Qwen2514BInstruct label "Qwen2.5-14B-Instruct" assertion.
Qwen257BInstruct label "Qwen2.5-7B-Instruct" assertion.
KGo1 comment "KG-o1 is a four-stage framework designed to enhance the intrinsic multi-hop reasoning abilities of LLMs. It leverages KGs to filter entities, generate logical paths, construct complex QA datasets for supervised fine-tuning (SFT) to simulate long-term thinking, and applies a Self-improved Adaptive DPO strategy to refine the LLMs' reasoning, ultimately improving LLM performance during inference for multi-hop question answering." assertion.
arXiv.2508.15790 describes KGo1 assertion.
arXiv.2508.15790 discusses ChatGPT4o assertion.
arXiv.2508.15790 discusses ChatGPT4oMini assertion.
arXiv.2508.15790 discusses CoTChainOfThought assertion.
arXiv.2508.15790 discusses DPO assertion.
arXiv.2508.15790 discusses DeepSeekR1 assertion.
arXiv.2508.15790 discusses DeepSeekR1DistillQwen14B assertion.
arXiv.2508.15790 discusses GRPO assertion.
arXiv.2508.15790 discusses Gemini20FlashThinking assertion.
arXiv.2508.15790 discusses Llama318BInstruct assertion.
arXiv.2508.15790 discusses OpenO1 assertion.
arXiv.2508.15790 discusses Qwen2514BInstruct assertion.
arXiv.2508.15790 discusses Qwen257BInstruct assertion.
KGo1 subject KGEnhancedLLMInference assertion.
arXiv.2508.15790 title "KG-o1: Enhancing Multi-hop Question Answering in Large Language Models via Knowledge Graph Integration" assertion.
KGo1 hasTopCategory KGEnhancedLLM assertion.