Samuele Lorefice
|
773203127f
|
They can now answer
|
2024-12-26 20:19:59 +01:00 |
|
Samuele Lorefice
|
4167c75279
|
Added rop of pending updates on bot start, reset command, AnswerChat method, GPU offload, limit to response lenght, context reduced to 2048, flash attention, 4 parallel decode queues, --keep of the original 810 tokens (which is the starting prompt)
|
2024-12-26 03:24:56 +01:00 |
|
Samuele Lorefice
|
b74e5d75e1
|
Fixed code, enabled to also always answer in a private chat
|
2024-12-26 01:34:10 +01:00 |
|
Samuele Lorefice
|
2357c7570c
|
Added llama.cpp and reworked the code
|
2024-12-26 00:35:45 +01:00 |
|