Samuele Lorefice
|
65950e3642
|
Solves context out of bound due to history
|
2024-12-26 04:32:11 +01:00 |
|
Samuele Lorefice
|
4167c75279
|
Added rop of pending updates on bot start, reset command, AnswerChat method, GPU offload, limit to response lenght, context reduced to 2048, flash attention, 4 parallel decode queues, --keep of the original 810 tokens (which is the starting prompt)
|
2024-12-26 03:24:56 +01:00 |
|
Samuele Lorefice
|
2357c7570c
|
Added llama.cpp and reworked the code
|
2024-12-26 00:35:45 +01:00 |
|
Samuele Lorefice
|
c6302112b2
|
Implemented also OpenAI
|
2024-12-25 21:38:26 +01:00 |
|
Samuele Lorefice
|
4b308b762a
|
Added LMStudio client, upgraded to .net 9.0
|
2024-12-25 19:37:14 +01:00 |
|
Samuele Lorefice
|
0ba298b955
|
Base commit
|
2024-12-24 23:08:08 +01:00 |
|