16k context variant possible?

by InvictusCreations - opened Jun 26

Jun 26

First off: crazy good model, can't decide if this or Stheno is my favorite.
However it creates the longest responses of any llama3 finetune I've tried so far. (between 600 and 1000 tokens.) So the maximum default context is full within < 8 responses.
Any plans for extending the context to 12k or 16k?

aifeifei798

Owner Jun 26

My plan for the next version is Uncensored, 12k or 16k, to begin after the Uncensored version is completed. Due to my hardware limitations, I will do my best.

aifeifei798

Owner Jun 27

https://huggingface.co/aifeifei798/llama3-8B-DarkIdol-2.1-Uncensored-32K
https://huggingface.co/LWDCLS/llama3-8B-DarkIdol-2.1-Uncensored-32K-GGUF-IQ-Imatrix-Request
Max centext to 131072

aifeifei798 changed discussion status to closed Jun 27

aifeifei798

Owner Jun 27

https://huggingface.co/aifeifei798/llama3-8B-DarkIdol-2.1-Uncensored-1048K

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment