Commit graph

1775 commits

Author SHA1 Message Date
Jannik Streidl
fdef2abdfb cuda fix 2024-03-22 12:48:48 +01:00
Jannik Streidl
fc4e762b05 ENV fix 2024-03-22 10:57:41 +01:00
Jannik Streidl
c08631d6ff updates docker-build action 2024-03-22 10:05:20 +01:00
Jannik Streidl
3f973fe77f i need coffee 2024-03-22 09:57:17 +01:00
Jannik Streidl
953d05428e grammar 2024-03-22 09:55:46 +01:00
Jannik Streidl
6dc4b748fb added comments 2024-03-22 09:50:01 +01:00
Jannik Streidl
fde0139bf7 All in one Dockerfile for including Ollama 2024-03-22 09:31:35 +01:00
Timothy J. Baek
afa591afb2 Create Dockerfile-ollama 2024-03-20 20:58:23 -07:00
Joseph Young
2588da0e27 Update PyTorch wheel source to CUDA 11.8
Modified the Dockerfile to install PyTorch, torchvision, and torchaudio from a CUDA 11.8 specific wheel URL. This ensures compatibility with the CUDA version in our environment and potentially improves performance and stability for GPU-accelerated operations.
2024-03-20 18:33:34 -04:00
Joseph Young
8ce48dc7d1 Fix typo in Dockerfile comment for model recommendation
Okay, this was driving my OCD crazy.

Corrected a spelling error in the Dockerfile's comment section to enhance documentation clarity. The typo 'persormance' was updated to 'performance,' ensuring accurate guidance on using multilingual sentence transformer models for better performance and language support.
2024-03-20 18:28:57 -04:00
Joseph Young
9bea40bd40 Adding the missing env variable
ENV RAG_EMBEDDING_MODEL_DEVICE_TYPE="cuda"
2024-03-20 18:16:11 -04:00
Jannik Streidl
9a8a48b879 gh build action for the different build args 2024-03-20 09:05:38 +01:00
Jannik Streidl
1f6739337b docker improvements & changed universal device type env for different models used 2024-03-20 08:44:09 +01:00
Jannik Streidl
132d741c55 set default to cpu 2024-03-18 17:09:43 +01:00
Jannik Streidl
5abe0089cb cuda support 2024-03-18 17:08:34 +01:00
Joseph Young
c5948d3e2c Updated Dockerfile for CUDA backend
Enabled NVIDIA CUDA backend build stage in the Dockerfile for enhanced performance with GPU support. Moved the environment variable defining the device type for the embedding and TTS models to be shared between CPU and GPU configurations. The default device type for CPU build is now explicitly set to "cpu", while the CUDA build retains "cuda", ensuring clarity and performance optimization across different hardware setups.
2024-03-17 22:50:52 -04:00
Joseph Young
c004ecdccc Refactor Dockerfile for CPU and CUDA builds
Switched to Chainguard images as base for both CPU and CUDA backend builds for improved security and compatibility. Replaced Ubuntu base with Chainguard's Python image for the CPU builds and PyTorch CUDA image for GPU acceleration, resolving python requirements conflicts. Updated package installation commands to align with the new Redhat-compatible base images. The Dockerfile now installs only the necessary dependencies, as Python is provided by the base image.

These changes will facilitate a more secure and streamlined build process with better dependency management across different platforms.
2024-03-17 17:03:43 -04:00
Joseph Young
e3b1cbbb86 Parametrize CUDA_VERSION in Dockerfile
Standardized CUDA_VERSION as a global ARG to ensure consistency and facilitate version updates across the Dockerfile. This change allows the CUDA version to be defined once at the beginning and reused, reducing the chance of mismatched versions and easing maintenance when changing CUDA versions. It further streamlines the build process for potential multi-stage builds with varying CUDA dependencies.

Refs #nvidia-update
2024-03-17 02:27:06 -04:00
Joseph Young
f6cef312f2 Optimize Dockerfile for CUDA support
Refactored the Dockerfile to better organize and streamline environment variable settings, emphasizing support for a CUDA-based WebUI backend while retaining the ability to build a CPU-only image. Consolidated ENV commands to reduce layers, improving build efficiency, and set a default PORT environment to enhance container usability. Enabled exposure of the backend service on port 8080 and leveraged combined RUN directives to minimize the image footprint. These changes facilitate a more robust deployment process, catering to both CPU and CUDA environments.
2024-03-17 01:55:37 -04:00
Joseph Young
75a40dead6
Create Dockerfile-cuda
+Dockerfile-cuda

I created this file to help add CUDA support to open-webui for access to a GPU during embedding operations.
2024-03-16 19:26:21 -04:00
Jannik S
29e48b1c1f
Exposed port 8080 2024-03-16 20:11:09 +01:00
Jannik S
62ab163316
Update Dockerfile 2024-03-16 12:43:48 +01:00
Jannik Streidl
384b7e8462 changed from bullseye to bookworm + removed unused steps 2024-03-14 11:33:54 +01:00
Jannik Streidl
50bec32153 Dockerfile optimisation 2024-03-14 11:18:27 +01:00
Timothy Jaeryang Baek
7ae4669f35
Merge pull request #1131 from open-webui/dev
fix: env var migration issue
2024-03-10 21:29:20 -05:00
Timothy J. Baek
e3fc97241d fix: env var migration issue 2024-03-10 19:26:06 -07:00
Timothy J. Baek
b4c770d74b fix: disable openai model by default 2024-03-10 19:00:43 -07:00
Timothy Jaeryang Baek
11ca2703b0
Merge pull request #1130 from open-webui/dev
fix: rag
2024-03-10 20:41:58 -05:00
Timothy J. Baek
8df6b137cb fix: rag 2024-03-10 18:40:50 -07:00
Timothy Jaeryang Baek
89634046e7
Merge pull request #1107 from open-webui/dev
0.1.111
2024-03-10 17:00:56 -05:00
Timothy J. Baek
88d324b52d doc: changelog 2024-03-10 14:55:17 -07:00
Timothy J. Baek
64afde6c0e refac: config versioning 2024-03-10 14:15:32 -07:00
Timothy J. Baek
1e9d40af42 feat: update all models 2024-03-10 14:10:32 -07:00
Timothy J. Baek
98948814fd feat: toggle pdf ocr 2024-03-10 13:32:34 -07:00
Timothy J. Baek
96ada23272 Update requirements.txt 2024-03-10 00:00:16 -08:00
Timothy J. Baek
a02ac67b5c Update requirements.txt 2024-03-09 23:59:17 -08:00
Timothy J. Baek
350489e600 fix: requirements.txt 2024-03-09 23:49:07 -08:00
Timothy J. Baek
8a08ba0791 refac: settings save 2024-03-09 22:02:27 -08:00
Timothy J. Baek
c6667510c4 refac: confusing icon 2024-03-09 21:55:13 -08:00
Timothy J. Baek
bd84753c6b feat: title auto-generate for openai apis 2024-03-09 21:52:19 -08:00
Timothy J. Baek
a4ca1fc5c4 feat: model filter list env var 2024-03-09 21:47:01 -08:00
Timothy Jaeryang Baek
bcabd3df84
Merge pull request #1117 from open-webui/model-whitelist
feat: model filter (whitelist)
2024-03-10 00:30:43 -05:00
Timothy J. Baek
81daf4ceb9 feat: model filter frontend integration 2024-03-09 21:29:04 -08:00
Timothy J. Baek
b550e23bf6 feat: model filter backend 2024-03-09 21:19:20 -08:00
Timothy J. Baek
83a900bd4a fix: dragged layover 2024-03-09 20:27:54 -08:00
Timothy J. Baek
1a93021cc4 fix: log modified messages 2024-03-09 20:12:32 -08:00
Timothy J. Baek
6d5ff8d469 feat: frontend whitelist model selector 2024-03-09 01:50:31 -08:00
Timothy J. Baek
219466374d feat: model whitelist 2024-03-09 01:43:02 -08:00
Timothy J. Baek
d40edc09e1 feat: hide litellm by default 2024-03-09 00:50:48 -08:00
Timothy Jaeryang Baek
d99114518c
Merge pull request #1113 from open-webui/rag
feat: rag api
2024-03-09 02:22:40 -05:00