|
@@ -198,7 +198,7 @@ Alright, let's dive into what each key means in the yaml config above:
|
|
- `max_tokens` (Integer): Controls how many tokens are used in the response.
|
|
- `max_tokens` (Integer): Controls how many tokens are used in the response.
|
|
- `top_p` (Float): Controls the diversity of word selection. A higher value (closer to 1) makes word selection more diverse.
|
|
- `top_p` (Float): Controls the diversity of word selection. A higher value (closer to 1) makes word selection more diverse.
|
|
- `stream` (Boolean): Controls if the response is streamed back to the user (set to false).
|
|
- `stream` (Boolean): Controls if the response is streamed back to the user (set to false).
|
|
- - `prompt` (String): A prompt for the model to follow when generating responses, requires $context and $query variables.
|
|
|
|
|
|
+ - `prompt` (String): A prompt for the model to follow when generating responses, requires `$context` and `$query` variables.
|
|
- `system_prompt` (String): A system prompt for the model to follow when generating responses, in this case, it's set to the style of William Shakespeare.
|
|
- `system_prompt` (String): A system prompt for the model to follow when generating responses, in this case, it's set to the style of William Shakespeare.
|
|
- `stream` (Boolean): Controls if the response is streamed back to the user (set to false).
|
|
- `stream` (Boolean): Controls if the response is streamed back to the user (set to false).
|
|
- `number_documents` (Integer): Number of documents to pull from the vectordb as context, defaults to 1
|
|
- `number_documents` (Integer): Number of documents to pull from the vectordb as context, defaults to 1
|