Edit model card

test_tiny_mixtral_only_router

test_tiny_mixtral_only_router is a Mixure of Experts (MoE) made with the following models using a modified version of mergekit.

🧩 Configuration

base_model: openaccess-ai-collective/tiny-mistral
gate_mode: hidden
dtype: bfloat16
experts:
  - source_model: openaccess-ai-collective/tiny-mistral
    positive_prompts:
      - "math"
    # You can add negative_prompts if needed
  - source_model: openaccess-ai-collective/tiny-mistral

    positive_prompts:
      - "science"
  - source_model: openaccess-ai-collective/tiny-mistral
    positive_prompts:
      - "writing"
    # You can add negative_prompts if needed
  - source_model: openaccess-ai-collective/tiny-mistral
    positive_prompts:
      - "general"

This is a test version of arcee-ai's hidden state model. It is a router for a frankenMoE instead of the entire MoE itself

Downloads last month
8
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Model tree for JSpergel/test_tiny_mixtral_only_router

Finetuned
(4)
this model