You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+18-5Lines changed: 18 additions & 5 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -17,28 +17,33 @@ The initial public model identifier is `bge-m3`.
17
17
- Upstream model id: `BAAI/bge-m3`
18
18
- Backend: `sentence-transformers`
19
19
- Device:
20
-
-Apple Silicon macOS: `mps` when available, otherwise CPU
20
+
- macOS:`auto` by default, which selects`mps` when available, otherwise CPU
21
21
- all other current targets: CPU
22
22
- Provisioning: first-run download into a local cache directory
23
23
24
24
The command and HTTP layers are written against an internal backend registry so additional models or inference backends can be added later without changing the user-facing contracts.
25
25
26
26
## Acceleration support
27
27
28
-
Current hardware acceleration support is intentionally limited in `v0.1.2`:
28
+
Current hardware acceleration support in `v0.1.3`:
29
29
30
30
-`aarch64-apple-darwin`:
31
+
- defaults to `auto`
31
32
- uses Apple Metal Performance Shaders (`mps`) automatically when available
32
33
- falls back to CPU if MPS is unavailable
33
34
-`x86_64-apple-darwin`:
34
-
- CPU only
35
+
- defaults to `auto`
36
+
- uses Apple Metal Performance Shaders (`mps`) automatically when available
37
+
- falls back to CPU if MPS is unavailable
35
38
-`x86_64-unknown-linux-gnu`:
36
39
- CPU only
37
40
-`aarch64-unknown-linux-gnu`:
38
41
- CPU only
39
42
-`x86_64-pc-windows-msvc`:
40
43
- CPU only
41
44
45
+
Explicit device override is available on embedding, server, and daemon commands via `--device auto|cpu|mps`.
46
+
42
47
The current release does not expose CUDA, ROCm, DirectML, or Intel GPU acceleration paths yet.
43
48
44
49
## Requirements
@@ -86,7 +91,7 @@ Example response:
86
91
"embeddings": [[0.123, -0.456, 0.789]],
87
92
"runtime": {
88
93
"name": "bitloops-embeddings",
89
-
"version": "0.1.2"
94
+
"version": "0.1.3"
90
95
}
91
96
}
92
97
```
@@ -100,6 +105,13 @@ bitloops-embeddings embed \
100
105
--output ./embedding.json
101
106
```
102
107
108
+
Force CPU or request MPS explicitly:
109
+
110
+
```bash
111
+
bitloops-embeddings embed --model bge-m3 --input "Hello World" --device cpu
0 commit comments