GGUF
conversational
Philip Monk commited on
Commit
603395c
·
1 Parent(s): 80fc8d1

initial commit

Browse files
Files changed (3) hide show
  1. .gitattributes +1 -0
  2. README.md +30 -0
  3. rnj-1-8B-instruct-Q4_K_M.gguf +3 -0
.gitattributes CHANGED
@@ -33,3 +33,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ *.gguf filter=lfs diff=lfs merge=lfs -text
README.md CHANGED
@@ -1,3 +1,33 @@
1
  ---
2
  license: apache-2.0
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  license: apache-2.0
3
  ---
4
+ This is a GGUF-formatted checkpoint of
5
+ [rnj-1-instruct](https://huggingface.co/EssentialAI/rnj-1-instruct) suitable
6
+ for use in llama.cpp. This has been quantized with the Q4\_K\_M scheme, which
7
+ results in model weights of size 4.8GB.
8
+
9
+ Support for this model has been PR'd upstream, but in the meantime you can
10
+ build from source with these instructions for MacOS. For Linux, install cmake
11
+ using your package manager. For Windows, consult the llama.cpp [build
12
+ guide](https://github.com/ggml-org/llama.cpp/blob/master/docs/build.md).
13
+
14
+ ```bash
15
+ brew install cmake
16
+ git clone https://github.com/Essential-AI/llama.cpp
17
+ cd llama.cpp
18
+ git checkout rnj-1
19
+ cmake -B build
20
+ cmake --build build --config Release
21
+ ```
22
+
23
+ To run this model in a web UI, use this command and open your browser to http://localhost:8080:
24
+
25
+ ```
26
+ build/bin/llama-server -hf EssentialAI/rnj-1-instruct-GGUF -c 0
27
+ ```
28
+
29
+ To run it in the CLI, use this command:
30
+
31
+ ```
32
+ build/bin/llama-cli -hf EssentialAI/rnj-1-instruct-GGUF
33
+ ```
rnj-1-8B-instruct-Q4_K_M.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:89b8e7e212062029fb9fe3cec95a9eeaeeddde63d3b4c728107afa1eccf28e4f
3
+ size 5113914272