mirror of
https://github.com/OneUptime/oneuptime.git
synced 2026-04-06 00:32:12 +02:00
This commit refactors the code by improving the comments in the code and updating the job function in app.py. The comments are now focused on code that is hard to understand, and unnecessary comments are removed. The job function is converted to an async function to support asynchronous processing, which improves the performance and responsiveness of the application. These changes enhance the readability and efficiency of the codebase.
Llama
Prepare
- Download models from meta
- Once the model is downloaded, place them in the
Llama/Modelsfolder. Please make sure you also place tokenizer.model and tokenizer_checklist.chk in the same folder. - Edit
Dockerfileto include the model name in theMODEL_NAMEvariable. - Docker build
docker build -t llama . -f ./Llama/Dockerfile
Run
For Linux
docker run --gpus all -p 8547:8547 -it -v ./Llama/Models:/app/Models llama
For MacOS
docker run -p 8547:8547 -it -v ./Llama/Models:/app/Models llama
Run without a docker conatiner
uvicorn app:app --host 0.0.0.0 --port 8547