Models

This Langflow feature is currently in public preview. Development is ongoing, and the features and functionality are subject to change. Langflow, and the use of such, is subject to the DataStax Preview Terms.

Model components generate text using large language models.

Refer to your specific component’s documentation for more information on parameters.

Use a model component in a flow

Model components receive inputs and prompts for generating text, and the generated text is sent to an output component.

The model output can also be sent to the Language Model port and on to a Parse Data component, where the output can be parsed into structured Data objects.

This example has the OpenAI model in a chatbot flow. For more information, see the Quickstart.

AI/ML API

This component creates a ChatOpenAI model instance using the AIML API.

For more information, see AIML documentation.

Parameters

Inputs
Name	Type	Description
max_tokens	Integer	The maximum number of tokens to generate. Set to 0 for unlimited tokens. Range: 0-128000.
model_kwargs	Dictionary	Additional keyword arguments for the model.
model_name	String	The name of the AIML model to use. Options are predefined in AIML_CHAT_MODELS.
aiml_api_base	String	The base URL of the AIML API. Defaults to https://api.aimlapi.com.
api_key	SecretString	The AIML API Key to use for the model.
temperature	Float	Controls randomness in the output. Default: 0.1.
seed	Integer	Controls reproducibility of the job.

Outputs
Name	Type	Description
model	LanguageModel	An instance of ChatOpenAI configured with the specified parameters.

Name	Type	Description
model_id	String	The ID of the Amazon Bedrock model to use. Options include various models from Amazon, Anthropic, AI21, Cohere, Meta, Mistral, and Stability AI.
aws_access_key	SecretString	AWS Access Key for authentication.
aws_secret_key	SecretString	AWS Secret Key for authentication.
credentials_profile_name	String	Name of the AWS credentials profile to use (advanced).
region_name	String	AWS region name. Default: "us-east-1".
model_kwargs	Dictionary	Additional keyword arguments for the model (advanced).
endpoint_url	String	Custom endpoint URL for the Bedrock service (advanced).

Name	Display Name	Info
Model Name	Model Name	Specifies the name of the Azure OpenAI model to be used for text generation.
Azure Endpoint	Azure Endpoint	Your Azure endpoint, including the resource.
Deployment Name	Deployment Name	Specifies the name of the deployment.
API Version	API Version	Specifies the version of the Azure OpenAI API to be used.
API Key	API Key	Your Azure OpenAI API key.
Temperature	Temperature	Specifies the sampling temperature. Defaults to `0.7`.
Max Tokens	Max Tokens	Specifies the maximum number of tokens to generate. Defaults to `1000`.
Input Value	Input Value	Specifies the input text for text generation.
Stream	Stream	Specifies whether to stream the response from the model. Defaults to `False`.

Name	Display Name	Info
Cohere API Key	Cohere API Key	Your Cohere API key.
Max Tokens	Max Tokens	Specifies the maximum number of tokens to generate. Defaults to `256`.
Temperature	Temperature	Specifies the sampling temperature. Defaults to `0.75`.
Input Value	Input Value	Specifies the input text for text generation.

Name	Type	Description
max_tokens	Integer	Maximum number of tokens to generate. Set to 0 for unlimited. Range: 0-128000.
model_kwargs	Dictionary	Additional keyword arguments for the model.
json_mode	Boolean	If True, outputs JSON regardless of passing a schema.
model_name	String	The DeepSeek model to use. Default: deepseek-chat.
api_base	String	Base URL for API requests. Default: `https://api.deepseek.com`.
api_key	SecretString	Your DeepSeek API key for authentication.
temperature	Float	Controls randomness in responses. Range: [0.0, 2.0]. Default: 1.0.
seed	Integer	Number initialized for random number generation. Use the same seed integer for more reproducible results, and use a different seed number for more random results.

Name	Display Name	Info
Google API Key	Google API Key	Your Google API key to use for the Google Generative AI.
Model	Model	The name of the model to use, such as `"gemini-pro"`.
Max Output Tokens	Max Output Tokens	The maximum number of tokens to generate.
Temperature	Temperature	Run inference with this temperature.
Top K	Top K	Consider the set of top K most probable tokens.
Top P	Top P	The maximum cumulative probability of tokens to consider when sampling.
N	N	Number of chat completions to generate for each prompt.

Name	Type	Description
groq_api_key	SecretString	API key for the Groq API.
groq_api_base	String	Base URL path for API requests. Default: "https://api.groq.com" (advanced).
max_tokens	Integer	The maximum number of tokens to generate (advanced).
temperature	Float	Controls randomness in the output. Range: [0.0, 1.0]. Default: 0.1.
n	Integer	Number of chat completions to generate for each prompt (advanced).
model_name	String	The name of the Groq model to use. Options are dynamically fetched from the Groq API.

Name	Display Name	Info
model_id	String	The model ID from Hugging Face Hub. For example, "gpt2", "facebook/bart-large".
huggingfacehub_api_token	SecretString	Your Hugging Face API token for authentication.
temperature	Float	Controls randomness in the output. Range: [0.0, 1.0]. Default: 0.7.
max_new_tokens	Integer	Maximum number of tokens to generate. Default: 512.
top_p	Float	Nucleus sampling parameter. Range: [0.0, 1.0]. Default: 0.95.
top_k	Integer	Top-k sampling parameter. Default: 50.
model_kwargs	Dictionary	Additional keyword arguments to pass to the model.

Name	Type	Description
url	String	The base URL of the watsonx API.
project_id	String	Your watsonx Project ID.
api_key	SecretString	Your IBM watsonx API Key.
model_name	String	The name of the watsonx model to use. Options are dynamically fetched from the API.
max_tokens	Integer	The maximum number of tokens to generate. Default: `1000`.
stop_sequence	String	The sequence where generation should stop.
temperature	Float	Controls randomness in the output. Default: `0.1`.
top_p	Float	Controls nucleus sampling, which limits the model to tokens whose probability is below the `top_p` value. Range: Default: `0.9`.
frequency_penalty	Float	Controls frequency penalty. A positive value decreases the probability of repeating tokens, and a negative value increases the probability. Range: Default: `0.5`.
presence_penalty	Float	Controls presence penalty. A positive value increases the likelihood of new topics being introduced. Default: `0.3`.
seed	Integer	A random seed for the model. Default: `8`.
logprobs	Boolean	Whether to return log probabilities of output tokens or not. Default: `True`.
top_logprobs	Integer	The number of most likely tokens to return at each position. Default: `3`.
logit_bias	String	A JSON string of token IDs to bias or suppress.

Name	Type	Description
provider	String	The model provider to use. Options: "OpenAI", "Anthropic". Default: "OpenAI".
model_name	String	The name of the model to use. Options depend on the selected provider.
api_key	SecretString	The API Key for authentication with the selected provider.
input_value	String	The input text to send to the model.
system_message	String	A system message that helps set the behavior of the assistant.
stream	Boolean	Whether to stream the response. Default: `False`.
temperature	Float	Controls randomness in responses. Range: `[0.0, 1.0]`. Default: `0.1`.

Name	Type	Description
base_url	String	The URL where LM Studio is running. Default: "http://localhost:1234".
max_tokens	Integer	Maximum number of tokens to generate in the response. Default: 512.
temperature	Float	Controls randomness in the output. Range: [0.0, 2.0]. Default: 0.7.
top_p	Float	Controls diversity via nucleus sampling. Range: [0.0, 1.0]. Default: 1.0.
stop	List[String]	List of strings that stop generation when encountered (advanced).
stream	Boolean	Whether to stream the response. Default: False.
presence_penalty	Float	Penalizes repeated tokens. Range: [-2.0, 2.0]. Default: 0.0.
frequency_penalty	Float	Penalizes frequent tokens. Range: [-2.0, 2.0]. Default: 0.0.

Name	Type	Description
api_key	SecretString	Your Novita AI API Key.
model	String	The id of the Novita AI model to use.
max_tokens	Integer	The maximum number of tokens to generate. Set to 0 for unlimited tokens.
temperature	Float	Controls randomness in the output. Range: [0.0, 1.0]. Default: 0.7.
top_p	Float	Controls the nucleus sampling. Range: [0.0, 1.0]. Default: 1.0.
frequency_penalty	Float	Controls the frequency penalty. Range: [0.0, 2.0]. Default: 0.0.
presence_penalty	Float	Controls the presence penalty. Range: [0.0, 2.0]. Default: 0.0.

Name	Display Name	Info
Base URL	Base URL	Endpoint of the Ollama API.
Model Name	Model Name	The model name to use.
Temperature	Temperature	Controls the creativity of model responses.

Name	Display Name	Info
max_tokens	Max Tokens	Maximum number of tokens to generate
model_kwargs	Model Kwargs	Additional keyword arguments for the model
json_mode	JSON Mode	Enable JSON output mode
output_schema	Schema	Schema for the model’s output
model_name	Model Name	Name of the OpenAI model to use
openai_api_base	OpenAI API Base	Base URL for the OpenAI API
api_key	OpenAI API Key	API key for authentication
temperature	Temperature	Controls randomness in output
seed	Seed	Seed for reproducibility

Name	Type	Description
api_key	SecretString	Your OpenRouter API key for authentication.
site_url	String	Your site URL for OpenRouter rankings (advanced).
app_name	String	Your app name for OpenRouter rankings (advanced).
provider	String	The AI model provider to use.
model_name	String	The specific model to use for chat completion.
temperature	Float	Controls randomness in the output. Range: [0.0, 2.0]. Default: 0.7.
max_tokens	Integer	The maximum number of tokens to generate (advanced).

Name	Type	Description
model_name	String	The name of the Perplexity model to use. Options include various Llama 3.1 models.
max_output_tokens	Integer	The maximum number of tokens to generate.
api_key	SecretString	The Perplexity API Key for authentication.
temperature	Float	Controls randomness in the output. Default: 0.75.
top_p	Float	The maximum cumulative probability of tokens to consider when sampling (advanced).
n	Integer	Number of chat completions to generate for each prompt (advanced).
top_k	Integer	Number of top tokens to consider for top-k sampling. Must be positive (advanced).

Name	Display Name	Info
sambanova_url	SambaNova URL	Base URL path for API requests. The default is `https://api.sambanova.ai/v1/chat/completions`.
sambanova_api_key	SambaNova API Key	Your SambaNova API Key.
model_name	Model Name	The name of the Sambanova model to use.
max_tokens	Max Tokens	The maximum number of tokens to generate. Set to `0` for unlimited tokens.
temperature	Temperature	Controls randomness in the output. The default value is `0.07`.

Name	Type	Description
credentials	File	JSON credentials file. Leave empty to fallback to environment variables. File type: JSON.
model_name	String	The name of the Vertex AI model to use. Default: "gemini-1.5-pro".
project	String	The project ID (advanced).
location	String	The location for the Vertex AI API. Default: "us-central1" (advanced).
max_output_tokens	Integer	The maximum number of tokens to generate (advanced).
max_retries	Integer	Maximum number of retries for API calls. Default: 1 (advanced).
temperature	Float	Controls randomness in the output. Default: 0.0.
top_k	Integer	The number of highest probability vocabulary tokens to keep for top-k-filtering (advanced).
top_p	Float	The cumulative probability of parameter highest probability vocabulary tokens to keep for nucleus sampling. Default: 0.95 (advanced).
verbose	Boolean	Whether to print verbose output. Default: False (advanced).

Models

Was this helpful?