update

2024-11-16 11:42:29 +08:00 · 2024-11-15 14:59:20 +08:00 · 2024-11-15 14:59:20 +08:00 · 2024-11-15 14:59:20 +08:00 · 2024-11-15 14:59:20 +08:00 · 2024-11-15 14:59:20 +08:00
175 changed files with 3419 additions and 432 deletions
--- a/.github/actions/setup-poetry/action.yml
+++ b/.github/actions/setup-poetry/action.yml
@ -4,7 +4,7 @@ inputs:
  python-version:
    description: Python version to use and the Poetry installed with
    required: true
-    default: '3.10'
+    default: '3.11'
  poetry-version:
    description: Poetry version to set up
    required: true
--- a/.github/pull_request_template.md
+++ b/.github/pull_request_template.md
@ -1,34 +1,32 @@
-# Checklist:
+# Summary
 Please include a summary of the change and which issue is fixed. Please also include relevant motivation and context. List any dependencies that are required for this change.
 > [!Tip]
 > Close issue syntax: `Fixes #<issue number>` or `Resolves #<issue number>`, see [documentation](https://docs.github.com/en/issues/tracking-your-work-with-issues/linking-a-pull-request-to-an-issue#linking-a-pull-request-to-an-issue-using-a-keyword) for more details.
 # Screenshots
 <table>
  <tr>
  <td>Before: </td>
  <td>After: </td>
  </tr>
  <tr>
  <td>...</td>
  <td>...</td>
  </tr>
 </table>
 # Checklist
 > [!IMPORTANT]  
 > Please review the checklist below before submitting your pull request.
 - [ ] Please open an issue before creating a PR or link to an existing issue
 - [ ] I have performed a self-review of my own code
 - [ ] I have commented my code, particularly in hard-to-understand areas
 - [ ] I ran `dev/reformat`(backend) and `cd web && npx lint-staged`(frontend) to appease the lint gods
 # Description
 Describe the big picture of your changes here to communicate to the maintainers why we should accept this pull request. If it fixes a bug or resolves a feature request, be sure to link to that issue. Close issue syntax: `Fixes #<issue number>`, see [documentation](https://docs.github.com/en/issues/tracking-your-work-with-issues/linking-a-pull-request-to-an-issue#linking-a-pull-request-to-an-issue-using-a-keyword) for more details.
 Fixes 
 ## Type of Change
 - [ ] Bug fix (non-breaking change which fixes an issue)
 - [ ] New feature (non-breaking change which adds functionality)
 - [ ] Breaking change (fix or feature that would cause existing functionality to not work as expected)
 - [ ] This change requires a documentation update, included: [Dify Document](https://github.com/langgenius/dify-docs)
- [ ] Improvement, including but not limited to code refactoring, performance optimization, and UI/UX improvement
+- [x] I understand that this PR may be closed in case there was no previous discussion or issues. (This doesn't apply to typos!)
- [ ] Dependency upgrade
+- [x] I've added a test for each change that was introduced, and I tried as much as possible to make a single atomic change.
-
+- [x] I've updated the documentation accordingly.
-# Testing Instructions
+- [x] I ran `dev/reformat`(backend) and `cd web && npx lint-staged`(frontend) to appease the lint gods
 Please describe the tests that you ran to verify your changes. Provide instructions so we can reproduce. Please also list any relevant details for your test configuration
 - [ ] Test A
 - [ ] Test B
--- a/.github/workflows/api-tests.yml
+++ b/.github/workflows/api-tests.yml
@ -20,7 +20,6 @@ jobs:
    strategy:
      matrix:
        python-version:
          - "3.10"
          - "3.11"
          - "3.12"
--- a/.github/workflows/vdb-tests.yml
+++ b/.github/workflows/vdb-tests.yml
@ -20,7 +20,6 @@ jobs:
    strategy:
      matrix:
        python-version:
          - "3.10"
          - "3.11"
          - "3.12"
--- a/api/Dockerfile
+++ b/api/Dockerfile
@ -55,7 +55,7 @@ RUN apt-get update \
    && echo "deb http://deb.debian.org/debian testing main" > /etc/apt/sources.list \
    && apt-get update \
    # For Security
-    && apt-get install -y --no-install-recommends expat=2.6.3-2 libldap-2.5-0=2.5.18+dfsg-3+b1 perl=5.40.0-7 libsqlite3-0=3.46.1-1 zlib1g=1:1.3.dfsg+really1.3.1-1+b1 \
+    && apt-get install -y --no-install-recommends expat=2.6.4-1 libldap-2.5-0=2.5.18+dfsg-3+b1 perl=5.40.0-7 libsqlite3-0=3.46.1-1 zlib1g=1:1.3.dfsg+really1.3.1-1+b1 \
    # install a chinese font to support the use of tools like matplotlib
    && apt-get install -y fonts-noto-cjk \
    && apt-get autoremove -y \
--- a/api/README.md
+++ b/api/README.md
@ -2,9 +2,6 @@
 ## Usage
 > [!IMPORTANT]
 > In the v0.6.12 release, we deprecated `pip` as the package management tool for Dify API Backend service and replaced it with `poetry`.
 1. Start the docker-compose stack
   The backend require some middleware, including PostgreSQL, Redis, and Weaviate, which can be started together using `docker-compose`.
@ -30,26 +27,24 @@
   SECRET_KEY=${secret_key}" .env
   ```
-4. Create environment.
+4. Prepare Python environment
-   Dify API service uses [Poetry](https://python-poetry.org/docs/) to manage dependencies. You can execute `poetry shell` to activate the environment.
+   Dify API services requires Python 3.11 or 3.12, and the [Poetry](https://python-poetry.org/docs/) for dependency management.
    - To install Poetry, please refer to
      the [Poetry's installation guide](https://python-poetry.org/docs/#installation). The simplest way is to run the `pip install poetry` command to install Poetry on pip.
    - Run `poetry env use 3.12` to switch to the Python version for Poetry, please refer the usage of `poetry env use`
      command in [Poetry docs](https://python-poetry.org/docs/managing-environments/#switching-between-environments).
    - Run `poetry shell` to activate the shell environment with Poetry support.
 5. Install dependencies
   ```bash
-   poetry env use 3.10
+   cd api
   poetry env use 3.12
   poetry install
   ```
-   In case of contributors missing to update dependencies for `pyproject.toml`, you can perform the following shell instead.
+6. Run db migration
   ```bash
   poetry shell                                               # activate current environment
   poetry add $(cat requirements.txt)           # install dependencies of production and update pyproject.toml
   poetry add $(cat requirements-dev.txt) --group dev    # install dependencies of development and update pyproject.toml
   ```
 6. Run migrate
   Before the first launch, migrate the database to the latest version.
@ -57,15 +52,18 @@
   poetry run python -m flask db upgrade
   ```
-7. Start backend
+7. Start api service
   ```bash
-   poetry run python -m flask run --host 0.0.0.0 --port=5001 --debug
+   poetry run python -m flask run --host 0.0.0.0 --port=5001
   ```
 8. Start Dify [web](../web) service.
 9. Setup your application by visiting `http://localhost:3000`...
-10. If you need to handle and debug the async tasks (e.g. dataset importing and documents indexing), please start the worker service.
+
 10. Start the worker service, if you need to handle and debug the async tasks (e.g. dataset importing and documents
    indexing), please start the worker service.
   ```bash
   poetry run python -m celery -A app.celery worker -P gevent -c 1 --loglevel INFO -Q dataset,generation,mail,ops_trace,app_deletion
--- a/api/app.py
+++ b/api/app.py
@ -1,6 +1,11 @@
 import os
 import sys
 python_version = sys.version_info
 if not ((3, 11) <= python_version < (3, 13)):
    print(f"Python 3.11 or 3.12 is required, current version is {python_version.major}.{python_version.minor}")
    raise SystemExit(1)
 from configs import dify_config
 if not dify_config.DEBUG:
@ -30,9 +35,6 @@ from models import account, dataset, model, source, task, tool, tools, web  # no
 # DO NOT REMOVE ABOVE
 if sys.version_info[:2] == (3, 10):
    print("Warning: Python 3.10 will not be supported in the next version.")
 warnings.simplefilter("ignore", ResourceWarning)
--- a/api/controllers/console/app/conversation.py
+++ b/api/controllers/console/app/conversation.py
@ -1,4 +1,4 @@
-from datetime import datetime, timezone
+from datetime import UTC, datetime
 import pytz
 from flask_login import current_user
@ -314,7 +314,7 @@ def _get_conversation(app_model, conversation_id):
        raise NotFound("Conversation Not Exists.")
    if not conversation.read_at:
-        conversation.read_at = datetime.now(timezone.utc).replace(tzinfo=None)
+        conversation.read_at = datetime.now(UTC).replace(tzinfo=None)
        conversation.read_account_id = current_user.id
        db.session.commit()
--- a/api/controllers/console/app/site.py
+++ b/api/controllers/console/app/site.py
@ -1,4 +1,4 @@
-from datetime import datetime, timezone
+from datetime import UTC, datetime
 from flask_login import current_user
 from flask_restful import Resource, marshal_with, reqparse
@ -75,7 +75,7 @@ class AppSite(Resource):
                setattr(site, attr_name, value)
        site.updated_by = current_user.id
-        site.updated_at = datetime.now(timezone.utc).replace(tzinfo=None)
+        site.updated_at = datetime.now(UTC).replace(tzinfo=None)
        db.session.commit()
        return site
@ -99,7 +99,7 @@ class AppSiteAccessTokenReset(Resource):
        site.code = Site.generate_code(16)
        site.updated_by = current_user.id
-        site.updated_at = datetime.now(timezone.utc).replace(tzinfo=None)
+        site.updated_at = datetime.now(UTC).replace(tzinfo=None)
        db.session.commit()
        return site
--- a/api/controllers/console/auth/activate.py
+++ b/api/controllers/console/auth/activate.py
@ -65,7 +65,7 @@ class ActivateApi(Resource):
        account.timezone = args["timezone"]
        account.interface_theme = "light"
        account.status = AccountStatus.ACTIVE.value
-        account.initialized_at = datetime.datetime.now(datetime.timezone.utc).replace(tzinfo=None)
+        account.initialized_at = datetime.datetime.now(datetime.UTC).replace(tzinfo=None)
        db.session.commit()
        token_pair = AccountService.login(account, ip_address=extract_remote_ip(request))
--- a/api/controllers/console/auth/oauth.py
+++ b/api/controllers/console/auth/oauth.py
@ -1,5 +1,5 @@
 import logging
-from datetime import datetime, timezone
+from datetime import UTC, datetime
 from typing import Optional
 import requests
@ -106,7 +106,7 @@ class OAuthCallback(Resource):
        if account.status == AccountStatus.PENDING.value:
            account.status = AccountStatus.ACTIVE.value
-            account.initialized_at = datetime.now(timezone.utc).replace(tzinfo=None)
+            account.initialized_at = datetime.now(UTC).replace(tzinfo=None)
            db.session.commit()
        try:
--- a/api/controllers/console/datasets/data_source.py
+++ b/api/controllers/console/datasets/data_source.py
@ -83,7 +83,7 @@ class DataSourceApi(Resource):
        if action == "enable":
            if data_source_binding.disabled:
                data_source_binding.disabled = False
-                data_source_binding.updated_at = datetime.datetime.now(datetime.timezone.utc).replace(tzinfo=None)
+                data_source_binding.updated_at = datetime.datetime.now(datetime.UTC).replace(tzinfo=None)
                db.session.add(data_source_binding)
                db.session.commit()
            else:
@ -92,7 +92,7 @@ class DataSourceApi(Resource):
        if action == "disable":
            if not data_source_binding.disabled:
                data_source_binding.disabled = True
-                data_source_binding.updated_at = datetime.datetime.now(datetime.timezone.utc).replace(tzinfo=None)
+                data_source_binding.updated_at = datetime.datetime.now(datetime.UTC).replace(tzinfo=None)
                db.session.add(data_source_binding)
                db.session.commit()
            else:
--- a/api/controllers/console/datasets/datasets_document.py
+++ b/api/controllers/console/datasets/datasets_document.py
@ -1,6 +1,6 @@
 import logging
 from argparse import ArgumentTypeError
-from datetime import datetime, timezone
+from datetime import UTC, datetime
 from flask import request
 from flask_login import current_user
@ -665,7 +665,7 @@ class DocumentProcessingApi(DocumentResource):
                raise InvalidActionError("Document not in indexing state.")
            document.paused_by = current_user.id
-            document.paused_at = datetime.now(timezone.utc).replace(tzinfo=None)
+            document.paused_at = datetime.now(UTC).replace(tzinfo=None)
            document.is_paused = True
            db.session.commit()
@ -745,7 +745,7 @@ class DocumentMetadataApi(DocumentResource):
                    document.doc_metadata[key] = value
        document.doc_type = doc_type
-        document.updated_at = datetime.now(timezone.utc).replace(tzinfo=None)
+        document.updated_at = datetime.now(UTC).replace(tzinfo=None)
        db.session.commit()
        return {"result": "success", "message": "Document metadata updated."}, 200
@ -787,7 +787,7 @@ class DocumentStatusApi(DocumentResource):
            document.enabled = True
            document.disabled_at = None
            document.disabled_by = None
-            document.updated_at = datetime.now(timezone.utc).replace(tzinfo=None)
+            document.updated_at = datetime.now(UTC).replace(tzinfo=None)
            db.session.commit()
            # Set cache to prevent indexing the same document multiple times
@ -804,9 +804,9 @@ class DocumentStatusApi(DocumentResource):
                raise InvalidActionError("Document already disabled.")
            document.enabled = False
-            document.disabled_at = datetime.now(timezone.utc).replace(tzinfo=None)
+            document.disabled_at = datetime.now(UTC).replace(tzinfo=None)
            document.disabled_by = current_user.id
-            document.updated_at = datetime.now(timezone.utc).replace(tzinfo=None)
+            document.updated_at = datetime.now(UTC).replace(tzinfo=None)
            db.session.commit()
            # Set cache to prevent indexing the same document multiple times
@ -821,9 +821,9 @@ class DocumentStatusApi(DocumentResource):
                raise InvalidActionError("Document already archived.")
            document.archived = True
-            document.archived_at = datetime.now(timezone.utc).replace(tzinfo=None)
+            document.archived_at = datetime.now(UTC).replace(tzinfo=None)
            document.archived_by = current_user.id
-            document.updated_at = datetime.now(timezone.utc).replace(tzinfo=None)
+            document.updated_at = datetime.now(UTC).replace(tzinfo=None)
            db.session.commit()
            if document.enabled:
@ -840,7 +840,7 @@ class DocumentStatusApi(DocumentResource):
            document.archived = False
            document.archived_at = None
            document.archived_by = None
-            document.updated_at = datetime.now(timezone.utc).replace(tzinfo=None)
+            document.updated_at = datetime.now(UTC).replace(tzinfo=None)
            db.session.commit()
            # Set cache to prevent indexing the same document multiple times
--- a/api/controllers/console/datasets/datasets_segments.py
+++ b/api/controllers/console/datasets/datasets_segments.py
@ -1,5 +1,5 @@
 import uuid
-from datetime import datetime, timezone
+from datetime import UTC, datetime
 import pandas as pd
 from flask import request
@ -188,7 +188,7 @@ class DatasetDocumentSegmentApi(Resource):
                raise InvalidActionError("Segment is already disabled.")
            segment.enabled = False
-            segment.disabled_at = datetime.now(timezone.utc).replace(tzinfo=None)
+            segment.disabled_at = datetime.now(UTC).replace(tzinfo=None)
            segment.disabled_by = current_user.id
            db.session.commit()
--- a/api/controllers/console/explore/completion.py
+++ b/api/controllers/console/explore/completion.py
@ -1,5 +1,5 @@
 import logging
-from datetime import datetime, timezone
+from datetime import UTC, datetime
 from flask_login import current_user
 from flask_restful import reqparse
@ -46,7 +46,7 @@ class CompletionApi(InstalledAppResource):
        streaming = args["response_mode"] == "streaming"
        args["auto_generate_name"] = False
-        installed_app.last_used_at = datetime.now(timezone.utc).replace(tzinfo=None)
+        installed_app.last_used_at = datetime.now(UTC).replace(tzinfo=None)
        db.session.commit()
        try:
@ -106,7 +106,7 @@ class ChatApi(InstalledAppResource):
        args["auto_generate_name"] = False
-        installed_app.last_used_at = datetime.now(timezone.utc).replace(tzinfo=None)
+        installed_app.last_used_at = datetime.now(UTC).replace(tzinfo=None)
        db.session.commit()
        try:
--- a/api/controllers/console/explore/installed_app.py
+++ b/api/controllers/console/explore/installed_app.py
@ -1,4 +1,4 @@
-from datetime import datetime, timezone
+from datetime import UTC, datetime
 from flask_login import current_user
 from flask_restful import Resource, inputs, marshal_with, reqparse
@ -81,7 +81,7 @@ class InstalledAppsListApi(Resource):
                tenant_id=current_tenant_id,
                app_owner_tenant_id=app.tenant_id,
                is_pinned=False,
-                last_used_at=datetime.now(timezone.utc).replace(tzinfo=None),
+                last_used_at=datetime.now(UTC).replace(tzinfo=None),
            )
            db.session.add(new_installed_app)
            db.session.commit()
--- a/api/controllers/console/workspace/account.py
+++ b/api/controllers/console/workspace/account.py
@ -60,7 +60,7 @@ class AccountInitApi(Resource):
                raise InvalidInvitationCodeError()
            invitation_code.status = "used"
-            invitation_code.used_at = datetime.datetime.now(datetime.timezone.utc).replace(tzinfo=None)
+            invitation_code.used_at = datetime.datetime.now(datetime.UTC).replace(tzinfo=None)
            invitation_code.used_by_tenant_id = account.current_tenant_id
            invitation_code.used_by_account_id = account.id
@ -68,7 +68,7 @@ class AccountInitApi(Resource):
        account.timezone = args["timezone"]
        account.interface_theme = "light"
        account.status = "active"
-        account.initialized_at = datetime.datetime.now(datetime.timezone.utc).replace(tzinfo=None)
+        account.initialized_at = datetime.datetime.now(datetime.UTC).replace(tzinfo=None)
        db.session.commit()
        return {"result": "success"}
--- a/api/controllers/service_api/wraps.py
+++ b/api/controllers/service_api/wraps.py
@ -1,5 +1,5 @@
 from collections.abc import Callable
-from datetime import datetime, timezone
+from datetime import UTC, datetime
 from enum import Enum
 from functools import wraps
 from typing import Optional
@ -198,7 +198,7 @@ def validate_and_get_api_token(scope=None):
    if not api_token:
        raise Unauthorized("Access token is invalid")
-    api_token.last_used_at = datetime.now(timezone.utc).replace(tzinfo=None)
+    api_token.last_used_at = datetime.now(UTC).replace(tzinfo=None)
    db.session.commit()
    return api_token
--- a/api/core/agent/base_agent_runner.py
+++ b/api/core/agent/base_agent_runner.py
@ -2,7 +2,7 @@ import json
 import logging
 import uuid
 from collections.abc import Mapping, Sequence
-from datetime import datetime, timezone
+from datetime import UTC, datetime
 from typing import Optional, Union, cast
 from core.agent.entities import AgentEntity, AgentToolEntity
@ -419,7 +419,7 @@ class BaseAgentRunner(AppRunner):
            .first()
        )
-        db_variables.updated_at = datetime.now(timezone.utc).replace(tzinfo=None)
+        db_variables.updated_at = datetime.now(UTC).replace(tzinfo=None)
        db_variables.variables_str = json.dumps(jsonable_encoder(tool_variables.pool))
        db.session.commit()
        db.session.close()
--- a/api/core/app/app_config/entities.py
+++ b/api/core/app/app_config/entities.py
@ -1,5 +1,5 @@
 from collections.abc import Sequence
-from enum import Enum
+from enum import Enum, StrEnum
 from typing import Any, Optional
 from pydantic import BaseModel, Field, field_validator
@ -88,7 +88,7 @@ class PromptTemplateEntity(BaseModel):
    advanced_completion_prompt_template: Optional[AdvancedCompletionPromptTemplateEntity] = None
-class VariableEntityType(str, Enum):
+class VariableEntityType(StrEnum):
    TEXT_INPUT = "text-input"
    SELECT = "select"
    PARAGRAPH = "paragraph"
--- a/api/core/app/apps/message_based_app_generator.py
+++ b/api/core/app/apps/message_based_app_generator.py
@ -1,7 +1,7 @@
 import json
 import logging
 from collections.abc import Generator
-from datetime import datetime, timezone
+from datetime import UTC, datetime
 from typing import Optional, Union
 from sqlalchemy import and_
@ -200,7 +200,7 @@ class MessageBasedAppGenerator(BaseAppGenerator):
            db.session.commit()
            db.session.refresh(conversation)
        else:
-            conversation.updated_at = datetime.now(timezone.utc).replace(tzinfo=None)
+            conversation.updated_at = datetime.now(UTC).replace(tzinfo=None)
            db.session.commit()
        message = Message(
--- a/api/core/app/entities/queue_entities.py
+++ b/api/core/app/entities/queue_entities.py
@ -1,5 +1,5 @@
 from datetime import datetime
-from enum import Enum
+from enum import Enum, StrEnum
 from typing import Any, Optional
 from pydantic import BaseModel, field_validator
@ -11,7 +11,7 @@ from core.workflow.nodes import NodeType
 from core.workflow.nodes.base import BaseNodeData
-class QueueEvent(str, Enum):
+class QueueEvent(StrEnum):
    """
    QueueEvent enum
    """
--- a/api/core/app/task_pipeline/workflow_cycle_manage.py
+++ b/api/core/app/task_pipeline/workflow_cycle_manage.py
@ -1,7 +1,7 @@
 import json
 import time
 from collections.abc import Mapping, Sequence
-from datetime import datetime, timezone
+from datetime import UTC, datetime
 from typing import Any, Optional, Union, cast
 from sqlalchemy.orm import Session
@ -144,7 +144,7 @@ class WorkflowCycleManage:
        workflow_run.elapsed_time = time.perf_counter() - start_at
        workflow_run.total_tokens = total_tokens
        workflow_run.total_steps = total_steps
-        workflow_run.finished_at = datetime.now(timezone.utc).replace(tzinfo=None)
+        workflow_run.finished_at = datetime.now(UTC).replace(tzinfo=None)
        db.session.commit()
        db.session.refresh(workflow_run)
@ -191,7 +191,7 @@ class WorkflowCycleManage:
        workflow_run.elapsed_time = time.perf_counter() - start_at
        workflow_run.total_tokens = total_tokens
        workflow_run.total_steps = total_steps
-        workflow_run.finished_at = datetime.now(timezone.utc).replace(tzinfo=None)
+        workflow_run.finished_at = datetime.now(UTC).replace(tzinfo=None)
        db.session.commit()
@ -211,7 +211,7 @@ class WorkflowCycleManage:
        for workflow_node_execution in running_workflow_node_executions:
            workflow_node_execution.status = WorkflowNodeExecutionStatus.FAILED.value
            workflow_node_execution.error = error
-            workflow_node_execution.finished_at = datetime.now(timezone.utc).replace(tzinfo=None)
+            workflow_node_execution.finished_at = datetime.now(UTC).replace(tzinfo=None)
            workflow_node_execution.elapsed_time = (
                workflow_node_execution.finished_at - workflow_node_execution.created_at
            ).total_seconds()
@ -259,7 +259,7 @@ class WorkflowCycleManage:
                    NodeRunMetadataKey.ITERATION_ID: event.in_iteration_id,
                }
            )
-            workflow_node_execution.created_at = datetime.now(timezone.utc).replace(tzinfo=None)
+            workflow_node_execution.created_at = datetime.now(UTC).replace(tzinfo=None)
            session.add(workflow_node_execution)
            session.commit()
@ -282,7 +282,7 @@ class WorkflowCycleManage:
        execution_metadata = (
            json.dumps(jsonable_encoder(event.execution_metadata)) if event.execution_metadata else None
        )
-        finished_at = datetime.now(timezone.utc).replace(tzinfo=None)
+        finished_at = datetime.now(UTC).replace(tzinfo=None)
        elapsed_time = (finished_at - event.start_at).total_seconds()
        db.session.query(WorkflowNodeExecution).filter(WorkflowNodeExecution.id == workflow_node_execution.id).update(
@ -326,7 +326,7 @@ class WorkflowCycleManage:
        inputs = WorkflowEntry.handle_special_values(event.inputs)
        process_data = WorkflowEntry.handle_special_values(event.process_data)
        outputs = WorkflowEntry.handle_special_values(event.outputs)
-        finished_at = datetime.now(timezone.utc).replace(tzinfo=None)
+        finished_at = datetime.now(UTC).replace(tzinfo=None)
        elapsed_time = (finished_at - event.start_at).total_seconds()
        execution_metadata = (
            json.dumps(jsonable_encoder(event.execution_metadata)) if event.execution_metadata else None
@ -654,7 +654,7 @@ class WorkflowCycleManage:
                if event.error is None
                else WorkflowNodeExecutionStatus.FAILED,
                error=None,
-                elapsed_time=(datetime.now(timezone.utc).replace(tzinfo=None) - event.start_at).total_seconds(),
+                elapsed_time=(datetime.now(UTC).replace(tzinfo=None) - event.start_at).total_seconds(),
                total_tokens=event.metadata.get("total_tokens", 0) if event.metadata else 0,
                execution_metadata=event.metadata,
                finished_at=int(time.time()),
--- a/api/core/entities/provider_configuration.py
+++ b/api/core/entities/provider_configuration.py
@ -240,7 +240,7 @@ class ProviderConfiguration(BaseModel):
        if provider_record:
            provider_record.encrypted_config = json.dumps(credentials)
            provider_record.is_valid = True
-            provider_record.updated_at = datetime.datetime.now(datetime.timezone.utc).replace(tzinfo=None)
+            provider_record.updated_at = datetime.datetime.now(datetime.UTC).replace(tzinfo=None)
            db.session.commit()
        else:
            provider_record = Provider(
@ -394,7 +394,7 @@ class ProviderConfiguration(BaseModel):
        if provider_model_record:
            provider_model_record.encrypted_config = json.dumps(credentials)
            provider_model_record.is_valid = True
-            provider_model_record.updated_at = datetime.datetime.now(datetime.timezone.utc).replace(tzinfo=None)
+            provider_model_record.updated_at = datetime.datetime.now(datetime.UTC).replace(tzinfo=None)
            db.session.commit()
        else:
            provider_model_record = ProviderModel(
@ -468,7 +468,7 @@ class ProviderConfiguration(BaseModel):
        if model_setting:
            model_setting.enabled = True
-            model_setting.updated_at = datetime.datetime.now(datetime.timezone.utc).replace(tzinfo=None)
+            model_setting.updated_at = datetime.datetime.now(datetime.UTC).replace(tzinfo=None)
            db.session.commit()
        else:
            model_setting = ProviderModelSetting(
@ -503,7 +503,7 @@ class ProviderConfiguration(BaseModel):
        if model_setting:
            model_setting.enabled = False
-            model_setting.updated_at = datetime.datetime.now(datetime.timezone.utc).replace(tzinfo=None)
+            model_setting.updated_at = datetime.datetime.now(datetime.UTC).replace(tzinfo=None)
            db.session.commit()
        else:
            model_setting = ProviderModelSetting(
@ -570,7 +570,7 @@ class ProviderConfiguration(BaseModel):
        if model_setting:
            model_setting.load_balancing_enabled = True
-            model_setting.updated_at = datetime.datetime.now(datetime.timezone.utc).replace(tzinfo=None)
+            model_setting.updated_at = datetime.datetime.now(datetime.UTC).replace(tzinfo=None)
            db.session.commit()
        else:
            model_setting = ProviderModelSetting(
@ -605,7 +605,7 @@ class ProviderConfiguration(BaseModel):
        if model_setting:
            model_setting.load_balancing_enabled = False
-            model_setting.updated_at = datetime.datetime.now(datetime.timezone.utc).replace(tzinfo=None)
+            model_setting.updated_at = datetime.datetime.now(datetime.UTC).replace(tzinfo=None)
            db.session.commit()
        else:
            model_setting = ProviderModelSetting(
--- a/api/core/file/enums.py
+++ b/api/core/file/enums.py
@ -1,7 +1,7 @@
-from enum import Enum
+from enum import StrEnum
-class FileType(str, Enum):
+class FileType(StrEnum):
    IMAGE = "image"
    DOCUMENT = "document"
    AUDIO = "audio"
@ -16,7 +16,7 @@ class FileType(str, Enum):
        raise ValueError(f"No matching enum found for value '{value}'")
-class FileTransferMethod(str, Enum):
+class FileTransferMethod(StrEnum):
    REMOTE_URL = "remote_url"
    LOCAL_FILE = "local_file"
    TOOL_FILE = "tool_file"
@ -29,7 +29,7 @@ class FileTransferMethod(str, Enum):
        raise ValueError(f"No matching enum found for value '{value}'")
-class FileBelongsTo(str, Enum):
+class FileBelongsTo(StrEnum):
    USER = "user"
    ASSISTANT = "assistant"
@ -41,7 +41,7 @@ class FileBelongsTo(str, Enum):
        raise ValueError(f"No matching enum found for value '{value}'")
-class FileAttribute(str, Enum):
+class FileAttribute(StrEnum):
    TYPE = "type"
    SIZE = "size"
    NAME = "name"
@ -51,5 +51,5 @@ class FileAttribute(str, Enum):
    EXTENSION = "extension"
-class ArrayFileAttribute(str, Enum):
+class ArrayFileAttribute(StrEnum):
    LENGTH = "length"
--- a/api/core/helper/code_executor/code_executor.py
+++ b/api/core/helper/code_executor/code_executor.py
@ -1,6 +1,6 @@
 import logging
 from collections.abc import Mapping
-from enum import Enum
+from enum import StrEnum
 from threading import Lock
 from typing import Any, Optional
@ -31,7 +31,7 @@ class CodeExecutionResponse(BaseModel):
    data: Data
-class CodeLanguage(str, Enum):
+class CodeLanguage(StrEnum):
    PYTHON3 = "python3"
    JINJA2 = "jinja2"
    JAVASCRIPT = "javascript"
--- a/api/core/indexing_runner.py
+++ b/api/core/indexing_runner.py
@ -84,7 +84,7 @@ class IndexingRunner:
            except ProviderTokenNotInitError as e:
                dataset_document.indexing_status = "error"
                dataset_document.error = str(e.description)
-                dataset_document.stopped_at = datetime.datetime.now(datetime.timezone.utc).replace(tzinfo=None)
+                dataset_document.stopped_at = datetime.datetime.now(datetime.UTC).replace(tzinfo=None)
                db.session.commit()
            except ObjectDeletedError:
                logging.warning("Document deleted, document id: {}".format(dataset_document.id))
@ -92,7 +92,7 @@ class IndexingRunner:
                logging.exception("consume document failed")
                dataset_document.indexing_status = "error"
                dataset_document.error = str(e)
-                dataset_document.stopped_at = datetime.datetime.now(datetime.timezone.utc).replace(tzinfo=None)
+                dataset_document.stopped_at = datetime.datetime.now(datetime.UTC).replace(tzinfo=None)
                db.session.commit()
    def run_in_splitting_status(self, dataset_document: DatasetDocument):
@ -140,13 +140,13 @@ class IndexingRunner:
        except ProviderTokenNotInitError as e:
            dataset_document.indexing_status = "error"
            dataset_document.error = str(e.description)
-            dataset_document.stopped_at = datetime.datetime.now(datetime.timezone.utc).replace(tzinfo=None)
+            dataset_document.stopped_at = datetime.datetime.now(datetime.UTC).replace(tzinfo=None)
            db.session.commit()
        except Exception as e:
            logging.exception("consume document failed")
            dataset_document.indexing_status = "error"
            dataset_document.error = str(e)
-            dataset_document.stopped_at = datetime.datetime.now(datetime.timezone.utc).replace(tzinfo=None)
+            dataset_document.stopped_at = datetime.datetime.now(datetime.UTC).replace(tzinfo=None)
            db.session.commit()
    def run_in_indexing_status(self, dataset_document: DatasetDocument):
@ -198,13 +198,13 @@ class IndexingRunner:
        except ProviderTokenNotInitError as e:
            dataset_document.indexing_status = "error"
            dataset_document.error = str(e.description)
-            dataset_document.stopped_at = datetime.datetime.now(datetime.timezone.utc).replace(tzinfo=None)
+            dataset_document.stopped_at = datetime.datetime.now(datetime.UTC).replace(tzinfo=None)
            db.session.commit()
        except Exception as e:
            logging.exception("consume document failed")
            dataset_document.indexing_status = "error"
            dataset_document.error = str(e)
-            dataset_document.stopped_at = datetime.datetime.now(datetime.timezone.utc).replace(tzinfo=None)
+            dataset_document.stopped_at = datetime.datetime.now(datetime.UTC).replace(tzinfo=None)
            db.session.commit()
    def indexing_estimate(
@ -357,7 +357,7 @@ class IndexingRunner:
            after_indexing_status="splitting",
            extra_update_params={
                DatasetDocument.word_count: sum(len(text_doc.page_content) for text_doc in text_docs),
-                DatasetDocument.parsing_completed_at: datetime.datetime.now(datetime.timezone.utc).replace(tzinfo=None),
+                DatasetDocument.parsing_completed_at: datetime.datetime.now(datetime.UTC).replace(tzinfo=None),
            },
        )
@ -449,7 +449,7 @@ class IndexingRunner:
        doc_store.add_documents(documents)
        # update document status to indexing
-        cur_time = datetime.datetime.now(datetime.timezone.utc).replace(tzinfo=None)
+        cur_time = datetime.datetime.now(datetime.UTC).replace(tzinfo=None)
        self._update_document_index_status(
            document_id=dataset_document.id,
            after_indexing_status="indexing",
@ -464,7 +464,7 @@ class IndexingRunner:
            dataset_document_id=dataset_document.id,
            update_params={
                DocumentSegment.status: "indexing",
-                DocumentSegment.indexing_at: datetime.datetime.now(datetime.timezone.utc).replace(tzinfo=None),
+                DocumentSegment.indexing_at: datetime.datetime.now(datetime.UTC).replace(tzinfo=None),
            },
        )
@ -669,7 +669,7 @@ class IndexingRunner:
            after_indexing_status="completed",
            extra_update_params={
                DatasetDocument.tokens: tokens,
-                DatasetDocument.completed_at: datetime.datetime.now(datetime.timezone.utc).replace(tzinfo=None),
+                DatasetDocument.completed_at: datetime.datetime.now(datetime.UTC).replace(tzinfo=None),
                DatasetDocument.indexing_latency: indexing_end_at - indexing_start_at,
                DatasetDocument.error: None,
            },
@ -694,7 +694,7 @@ class IndexingRunner:
                    {
                        DocumentSegment.status: "completed",
                        DocumentSegment.enabled: True,
-                        DocumentSegment.completed_at: datetime.datetime.now(datetime.timezone.utc).replace(tzinfo=None),
+                        DocumentSegment.completed_at: datetime.datetime.now(datetime.UTC).replace(tzinfo=None),
                    }
                )
@ -727,7 +727,7 @@ class IndexingRunner:
                {
                    DocumentSegment.status: "completed",
                    DocumentSegment.enabled: True,
-                    DocumentSegment.completed_at: datetime.datetime.now(datetime.timezone.utc).replace(tzinfo=None),
+                    DocumentSegment.completed_at: datetime.datetime.now(datetime.UTC).replace(tzinfo=None),
                }
            )
@ -838,7 +838,7 @@ class IndexingRunner:
        doc_store.add_documents(documents)
        # update document status to indexing
-        cur_time = datetime.datetime.now(datetime.timezone.utc).replace(tzinfo=None)
+        cur_time = datetime.datetime.now(datetime.UTC).replace(tzinfo=None)
        self._update_document_index_status(
            document_id=dataset_document.id,
            after_indexing_status="indexing",
@ -853,7 +853,7 @@ class IndexingRunner:
            dataset_document_id=dataset_document.id,
            update_params={
                DocumentSegment.status: "indexing",
-                DocumentSegment.indexing_at: datetime.datetime.now(datetime.timezone.utc).replace(tzinfo=None),
+                DocumentSegment.indexing_at: datetime.datetime.now(datetime.UTC).replace(tzinfo=None),
            },
        )
        pass
--- a/api/core/model_runtime/entities/message_entities.py
+++ b/api/core/model_runtime/entities/message_entities.py
@ -1,5 +1,5 @@
 from abc import ABC
-from enum import Enum
+from enum import Enum, StrEnum
 from typing import Optional
 from pydantic import BaseModel, Field, field_validator
@ -93,7 +93,7 @@ class ImagePromptMessageContent(PromptMessageContent):
    Model class for image prompt message content.
    """
-    class DETAIL(str, Enum):
+    class DETAIL(StrEnum):
        LOW = "low"
        HIGH = "high"
--- a/api/core/model_runtime/entities/model_entities.py
+++ b/api/core/model_runtime/entities/model_entities.py
@ -1,5 +1,5 @@
 from decimal import Decimal
-from enum import Enum
+from enum import Enum, StrEnum
 from typing import Any, Optional
 from pydantic import BaseModel, ConfigDict
@ -89,7 +89,7 @@ class ModelFeature(Enum):
    STREAM_TOOL_CALL = "stream-tool-call"
-class DefaultParameterName(str, Enum):
+class DefaultParameterName(StrEnum):
    """
    Enum class for parameter template variable.
    """
--- a/api/core/model_runtime/model_providers/azure_openai/llm/llm.py
+++ b/api/core/model_runtime/model_providers/azure_openai/llm/llm.py
@ -113,7 +113,7 @@ class AzureOpenAILargeLanguageModel(_CommonAzureOpenAI, LargeLanguageModel):
        try:
            client = AzureOpenAI(**self._to_credential_kwargs(credentials))
-            if model.startswith("o1"):
+            if "o1" in model:
                client.chat.completions.create(
                    messages=[{"role": "user", "content": "ping"}],
                    model=model,
@ -311,7 +311,7 @@ class AzureOpenAILargeLanguageModel(_CommonAzureOpenAI, LargeLanguageModel):
        prompt_messages = self._clear_illegal_prompt_messages(model, prompt_messages)
        block_as_stream = False
-        if model.startswith("o1"):
+        if "o1" in model:
            if stream:
                block_as_stream = True
                stream = False
@ -404,7 +404,7 @@ class AzureOpenAILargeLanguageModel(_CommonAzureOpenAI, LargeLanguageModel):
                                ]
                            )
-        if model.startswith("o1"):
+        if "o1" in model:
            system_message_count = len([m for m in prompt_messages if isinstance(m, SystemPromptMessage)])
            if system_message_count > 0:
                new_prompt_messages = []
@ -653,7 +653,7 @@ class AzureOpenAILargeLanguageModel(_CommonAzureOpenAI, LargeLanguageModel):
            tokens_per_message = 4
            # if there's a name, the role is omitted
            tokens_per_name = -1
-        elif model.startswith("gpt-35-turbo") or model.startswith("gpt-4") or model.startswith("o1"):
+        elif model.startswith("gpt-35-turbo") or model.startswith("gpt-4") or "o1" in model:
            tokens_per_message = 3
            tokens_per_name = 1
        else:
--- a/api/core/model_runtime/model_providers/gitee_ai/llm/Qwen2.5-72B-Instruct.yaml
+++ b/api/core/model_runtime/model_providers/gitee_ai/llm/Qwen2.5-72B-Instruct.yaml
@ -0,0 +1,95 @@
 model: Qwen2.5-72B-Instruct
 label:
  zh_Hans: Qwen2.5-72B-Instruct
  en_US: Qwen2.5-72B-Instruct
 model_type: llm
 features:
  - agent-thought
  - tool-call
  - stream-tool-call
 model_properties:
  mode: chat
  context_size: 32768
 parameter_rules:
  - name: max_tokens
    use_template: max_tokens
    label:
      en_US: "Max Tokens"
      zh_Hans: "最大Token数"
    type: int
    default: 512
    min: 1
    required: true
    help:
      en_US: "The maximum number of tokens that can be generated by the model varies depending on the model."
      zh_Hans: "模型可生成的最大 token 个数，不同模型上限不同。"
  - name: temperature
    use_template: temperature
    label:
      en_US: "Temperature"
      zh_Hans: "采样温度"
    type: float
    default: 0.7
    min: 0.0
    max: 1.0
    precision: 1
    required: true
    help:
      en_US: "The randomness of the sampling temperature control output. The temperature value is within the range of [0.0, 1.0]. The higher the value, the more random and creative the output; the lower the value, the more stable it is. It is recommended to adjust either top_p or temperature parameters according to your needs to avoid adjusting both at the same time."
      zh_Hans: "采样温度控制输出的随机性。温度值在 [0.0, 1.0] 范围内，值越高，输出越随机和创造性；值越低，输出越稳定。建议根据需求调整 top_p 或 temperature 参数，避免同时调整两者。"
  - name: top_p
    use_template: top_p
    label:
      en_US: "Top P"
      zh_Hans: "Top P"
    type: float
    default: 0.7
    min: 0.0
    max: 1.0
    precision: 1
    required: true
    help:
      en_US: "The value range of the sampling method is [0.0, 1.0]. The top_p value determines that the model selects tokens from the top p% of candidate words with the highest probability; when top_p is 0, this parameter is invalid. It is recommended to adjust either top_p or temperature parameters according to your needs to avoid adjusting both at the same time."
      zh_Hans: "采样方法的取值范围为 [0.0,1.0]。top_p 值确定模型从概率最高的前p%的候选词中选取 tokens；当 top_p 为 0 时，此参数无效。建议根据需求调整 top_p 或 temperature 参数，避免同时调整两者。"
  - name: top_k
    use_template: top_k
    label:
      en_US: "Top K"
      zh_Hans: "Top K"
    type: int
    default: 50
    min: 0
    max: 100
    required: true
    help:
      en_US: "The value range is [0,100], which limits the model to only select from the top k words with the highest probability when choosing the next word at each step. The larger the value, the more diverse text generation will be."
      zh_Hans: "取值范围为 [0,100]，限制模型在每一步选择下一个词时，只从概率最高的前 k 个词中选取。数值越大，文本生成越多样。"
  - name: frequency_penalty
    use_template: frequency_penalty
    label:
      en_US: "Frequency Penalty"
      zh_Hans: "频率惩罚"
    type: float
    default: 0
    min: -1.0
    max: 1.0
    precision: 1
    required: false
    help:
      en_US: "Used to adjust the frequency of repeated content in automatically generated text. Positive numbers reduce repetition, while negative numbers increase repetition. After setting this parameter, if a word has already appeared in the text, the model will decrease the probability of choosing that word for subsequent generation."
      zh_Hans: "用于调整自动生成文本中重复内容的频率。正数减少重复，负数增加重复。设置此参数后，如果一个词在文本中已经出现过，模型在后续生成中选择该词的概率会降低。"
  - name: user
    use_template: text
    label:
      en_US: "User"
      zh_Hans: "用户"
    type: string
    required: false
    help:
      en_US: "Used to track and differentiate conversation requests from different users."
      zh_Hans: "用于追踪和区分不同用户的对话请求。"
--- a/api/core/model_runtime/model_providers/gitee_ai/llm/_position.yaml
+++ b/api/core/model_runtime/model_providers/gitee_ai/llm/_position.yaml
@ -1,3 +1,4 @@
 - Qwen2.5-72B-Instruct
 - Qwen2-7B-Instruct
 - Qwen2-72B-Instruct
 - Yi-1.5-34B-Chat
--- a/api/core/model_runtime/model_providers/gitee_ai/llm/llm.py
+++ b/api/core/model_runtime/model_providers/gitee_ai/llm/llm.py
@ -6,6 +6,7 @@ from core.model_runtime.entities.message_entities import (
    PromptMessage,
    PromptMessageTool,
 )
 from core.model_runtime.entities.model_entities import ModelFeature
 from core.model_runtime.model_providers.openai_api_compatible.llm.llm import OAIAPICompatLargeLanguageModel
@ -28,14 +29,13 @@ class GiteeAILargeLanguageModel(OAIAPICompatLargeLanguageModel):
        user: Optional[str] = None,
    ) -> Union[LLMResult, Generator]:
        self._add_custom_parameters(credentials, model, model_parameters)
-        return super()._invoke(model, credentials, prompt_messages, model_parameters, tools, stop, stream)
+        return super()._invoke(model, credentials, prompt_messages, model_parameters, tools, stop, stream, user)
    def validate_credentials(self, model: str, credentials: dict) -> None:
        self._add_custom_parameters(credentials, model, None)
        super().validate_credentials(model, credentials)
-    @staticmethod
+    def _add_custom_parameters(self, credentials: dict, model: str, model_parameters: dict) -> None:
    def _add_custom_parameters(credentials: dict, model: str, model_parameters: dict) -> None:
        if model is None:
            model = "bge-large-zh-v1.5"
@ -45,3 +45,7 @@ class GiteeAILargeLanguageModel(OAIAPICompatLargeLanguageModel):
            credentials["mode"] = LLMMode.COMPLETION.value
        else:
            credentials["mode"] = LLMMode.CHAT.value
        schema = self.get_model_schema(model, credentials)
        if ModelFeature.TOOL_CALL in schema.features or ModelFeature.MULTI_TOOL_CALL in schema.features:
            credentials["function_calling_type"] = "tool_call"
--- a/api/core/model_runtime/model_providers/minimax/llm/abab7-chat-preview.yaml
+++ b/api/core/model_runtime/model_providers/minimax/llm/abab7-chat-preview.yaml
@ -0,0 +1,46 @@
 model: abab7-chat-preview
 label:
  en_US: Abab7-chat-preview
 model_type: llm
 features:
  - agent-thought
  - tool-call
  - stream-tool-call
 model_properties:
  mode: chat
  context_size: 245760
 parameter_rules:
  - name: temperature
    use_template: temperature
    min: 0.01
    max: 1
    default: 0.1
  - name: top_p
    use_template: top_p
    min: 0.01
    max: 1
    default: 0.95
  - name: max_tokens
    use_template: max_tokens
    required: true
    default: 2048
    min: 1
    max: 245760
  - name: mask_sensitive_info
    type: boolean
    default: true
    label:
      zh_Hans: 隐私保护
      en_US: Moderate
    help:
      zh_Hans: 对输出中易涉及隐私问题的文本信息进行打码，目前包括但不限于邮箱、域名、链接、证件号、家庭住址等，默认true，即开启打码
      en_US: Mask the sensitive info of the generated content, such as email/domain/link/address/phone/id..
  - name: presence_penalty
    use_template: presence_penalty
  - name: frequency_penalty
    use_template: frequency_penalty
 pricing:
  input: '0.1'
  output: '0.1'
  unit: '0.001'
  currency: RMB
--- a/api/core/model_runtime/model_providers/minimax/llm/llm.py
+++ b/api/core/model_runtime/model_providers/minimax/llm/llm.py
@ -34,6 +34,7 @@ from core.model_runtime.model_providers.minimax.llm.types import MinimaxMessage
 class MinimaxLargeLanguageModel(LargeLanguageModel):
    model_apis = {
        "abab7-chat-preview": MinimaxChatCompletionPro,
        "abab6.5s-chat": MinimaxChatCompletionPro,
        "abab6.5-chat": MinimaxChatCompletionPro,
        "abab6-chat": MinimaxChatCompletionPro,
--- a/api/core/model_runtime/model_providers/siliconflow/llm/Internvl2-26b.yaml
+++ b/api/core/model_runtime/model_providers/siliconflow/llm/Internvl2-26b.yaml
@ -0,0 +1,84 @@
 model: OpenGVLab/InternVL2-26B
 label:
  en_US: OpenGVLab/InternVL2-26B
 model_type: llm
 features:
  - vision
 model_properties:
  mode: chat
  context_size: 32768
 parameter_rules:
  - name: temperature
    use_template: temperature
    type: float
    default: 0.3
    min: 0.0
    max: 2.0
    help:
      zh_Hans: 用于控制随机性和多样性的程度。具体来说，temperature值控制了生成文本时对每个候选词的概率分布进行平滑的程度。较高的temperature值会降低概率分布的峰值，使得更多的低概率词被选择，生成结果更加多样化；而较低的temperature值则会增强概率分布的峰值，使得高概率词更容易被选择，生成结果更加确定。
      en_US: Used to control the degree of randomness and diversity. Specifically, the temperature value controls the degree to which the probability distribution of each candidate word is smoothed when generating text. A higher temperature value will reduce the peak value of the probability distribution, allowing more low-probability words to be selected, and the generated results will be more diverse; while a lower temperature value will enhance the peak value of the probability distribution, making it easier for high-probability words to be selected. , the generated results are more certain.
  - name: max_tokens
    use_template: max_tokens
    type: int
    default: 2000
    min: 1
    max: 2000
    help:
      zh_Hans: 用于指定模型在生成内容时token的最大数量，它定义了生成的上限，但不保证每次都会生成到这个数量。
      en_US: It is used to specify the maximum number of tokens when the model generates content. It defines the upper limit of generation, but does not guarantee that this number will be generated every time.
  - name: top_p
    use_template: top_p
    type: float
    default: 0.8
    min: 0.1
    max: 0.9
    help:
      zh_Hans: 生成过程中核采样方法概率阈值，例如，取值为0.8时，仅保留概率加起来大于等于0.8的最可能token的最小集合作为候选集。取值范围为（0,1.0)，取值越大，生成的随机性越高；取值越低，生成的确定性越高。
      en_US: The probability threshold of the kernel sampling method during the generation process. For example, when the value is 0.8, only the smallest set of the most likely tokens with a sum of probabilities greater than or equal to 0.8 is retained as the candidate set. The value range is (0,1.0). The larger the value, the higher the randomness generated; the lower the value, the higher the certainty generated.
  - name: top_k
    type: int
    min: 0
    max: 99
    label:
      zh_Hans: 取样数量
      en_US: Top k
    help:
      zh_Hans: 生成时，采样候选集的大小。例如，取值为50时，仅将单次生成中得分最高的50个token组成随机采样的候选集。取值越大，生成的随机性越高；取值越小，生成的确定性越高。
      en_US: The size of the sample candidate set when generated. For example, when the value is 50, only the 50 highest-scoring tokens in a single generation form a randomly sampled candidate set. The larger the value, the higher the randomness generated; the smaller the value, the higher the certainty generated.
  - name: seed
    required: false
    type: int
    default: 1234
    label:
      zh_Hans: 随机种子
      en_US: Random seed
    help:
      zh_Hans: 生成时使用的随机数种子，用户控制模型生成内容的随机性。支持无符号64位整数，默认值为 1234。在使用seed时，模型将尽可能生成相同或相似的结果，但目前不保证每次生成的结果完全相同。
      en_US: The random number seed used when generating, the user controls the randomness of the content generated by the model. Supports unsigned 64-bit integers, default value is 1234. When using seed, the model will try its best to generate the same or similar results, but there is currently no guarantee that the results will be exactly the same every time.
  - name: repetition_penalty
    required: false
    type: float
    default: 1.1
    label:
      zh_Hans: 重复惩罚
      en_US: Repetition penalty
    help:
      zh_Hans: 用于控制模型生成时的重复度。提高repetition_penalty时可以降低模型生成的重复度。1.0表示不做惩罚。
      en_US: Used to control the repeatability when generating models. Increasing repetition_penalty can reduce the duplication of model generation. 1.0 means no punishment.
  - name: response_format
    label:
      zh_Hans: 回复格式
      en_US: Response Format
    type: string
    help:
      zh_Hans: 指定模型必须输出的格式
      en_US: specifying the format that the model must output
    required: false
    options:
      - text
      - json_object
 pricing:
  input: '21'
  output: '21'
  unit: '0.000001'
  currency: RMB
--- a/api/core/model_runtime/model_providers/siliconflow/llm/Internvl2-8b.yaml
+++ b/api/core/model_runtime/model_providers/siliconflow/llm/Internvl2-8b.yaml
@ -0,0 +1,84 @@
 model: Pro/OpenGVLab/InternVL2-8B
 label:
  en_US: Pro/OpenGVLab/InternVL2-8B
 model_type: llm
 features:
  - vision
 model_properties:
  mode: chat
  context_size: 32768
 parameter_rules:
  - name: temperature
    use_template: temperature
    type: float
    default: 0.3
    min: 0.0
    max: 2.0
    help:
      zh_Hans: 用于控制随机性和多样性的程度。具体来说，temperature值控制了生成文本时对每个候选词的概率分布进行平滑的程度。较高的temperature值会降低概率分布的峰值，使得更多的低概率词被选择，生成结果更加多样化；而较低的temperature值则会增强概率分布的峰值，使得高概率词更容易被选择，生成结果更加确定。
      en_US: Used to control the degree of randomness and diversity. Specifically, the temperature value controls the degree to which the probability distribution of each candidate word is smoothed when generating text. A higher temperature value will reduce the peak value of the probability distribution, allowing more low-probability words to be selected, and the generated results will be more diverse; while a lower temperature value will enhance the peak value of the probability distribution, making it easier for high-probability words to be selected. , the generated results are more certain.
  - name: max_tokens
    use_template: max_tokens
    type: int
    default: 2000
    min: 1
    max: 2000
    help:
      zh_Hans: 用于指定模型在生成内容时token的最大数量，它定义了生成的上限，但不保证每次都会生成到这个数量。
      en_US: It is used to specify the maximum number of tokens when the model generates content. It defines the upper limit of generation, but does not guarantee that this number will be generated every time.
  - name: top_p
    use_template: top_p
    type: float
    default: 0.8
    min: 0.1
    max: 0.9
    help:
      zh_Hans: 生成过程中核采样方法概率阈值，例如，取值为0.8时，仅保留概率加起来大于等于0.8的最可能token的最小集合作为候选集。取值范围为（0,1.0)，取值越大，生成的随机性越高；取值越低，生成的确定性越高。
      en_US: The probability threshold of the kernel sampling method during the generation process. For example, when the value is 0.8, only the smallest set of the most likely tokens with a sum of probabilities greater than or equal to 0.8 is retained as the candidate set. The value range is (0,1.0). The larger the value, the higher the randomness generated; the lower the value, the higher the certainty generated.
  - name: top_k
    type: int
    min: 0
    max: 99
    label:
      zh_Hans: 取样数量
      en_US: Top k
    help:
      zh_Hans: 生成时，采样候选集的大小。例如，取值为50时，仅将单次生成中得分最高的50个token组成随机采样的候选集。取值越大，生成的随机性越高；取值越小，生成的确定性越高。
      en_US: The size of the sample candidate set when generated. For example, when the value is 50, only the 50 highest-scoring tokens in a single generation form a randomly sampled candidate set. The larger the value, the higher the randomness generated; the smaller the value, the higher the certainty generated.
  - name: seed
    required: false
    type: int
    default: 1234
    label:
      zh_Hans: 随机种子
      en_US: Random seed
    help:
      zh_Hans: 生成时使用的随机数种子，用户控制模型生成内容的随机性。支持无符号64位整数，默认值为 1234。在使用seed时，模型将尽可能生成相同或相似的结果，但目前不保证每次生成的结果完全相同。
      en_US: The random number seed used when generating, the user controls the randomness of the content generated by the model. Supports unsigned 64-bit integers, default value is 1234. When using seed, the model will try its best to generate the same or similar results, but there is currently no guarantee that the results will be exactly the same every time.
  - name: repetition_penalty
    required: false
    type: float
    default: 1.1
    label:
      zh_Hans: 重复惩罚
      en_US: Repetition penalty
    help:
      zh_Hans: 用于控制模型生成时的重复度。提高repetition_penalty时可以降低模型生成的重复度。1.0表示不做惩罚。
      en_US: Used to control the repeatability when generating models. Increasing repetition_penalty can reduce the duplication of model generation. 1.0 means no punishment.
  - name: response_format
    label:
      zh_Hans: 回复格式
      en_US: Response Format
    type: string
    help:
      zh_Hans: 指定模型必须输出的格式
      en_US: specifying the format that the model must output
    required: false
    options:
      - text
      - json_object
 pricing:
  input: '21'
  output: '21'
  unit: '0.000001'
  currency: RMB
--- a/api/core/model_runtime/model_providers/siliconflow/llm/_position.yaml
+++ b/api/core/model_runtime/model_providers/siliconflow/llm/_position.yaml
@ -1,16 +1,18 @@
 - Tencent/Hunyuan-A52B-Instruct
 - Qwen/Qwen2.5-72B-Instruct
 - Qwen/Qwen2.5-32B-Instruct
 - Qwen/Qwen2.5-14B-Instruct
 - Qwen/Qwen2.5-7B-Instruct
 - Qwen/Qwen2.5-Coder-32B-Instruct
 - Qwen/Qwen2.5-Coder-7B-Instruct
 - Qwen/Qwen2.5-Math-72B-Instruct
- Qwen/Qwen2-72B-Instruct
+- Qwen/Qwen2-VL-72B-Instruct
 - Qwen/Qwen2-57B-A14B-Instruct
 - Qwen/Qwen2-7B-Instruct
 - Qwen/Qwen2-1.5B-Instruct
 - Pro/Qwen/Qwen2-VL-7B-Instruct
 - OpenGVLab/InternVL2-Llama3-76B
 - OpenGVLab/InternVL2-26B
 - Pro/OpenGVLab/InternVL2-8B
 - deepseek-ai/DeepSeek-V2.5
 - deepseek-ai/DeepSeek-V2-Chat
 - deepseek-ai/DeepSeek-Coder-V2-Instruct
 - THUDM/glm-4-9b-chat
 - 01-ai/Yi-1.5-34B-Chat-16K
 - 01-ai/Yi-1.5-9B-Chat-16K
@ -20,9 +22,6 @@
 - meta-llama/Meta-Llama-3.1-405B-Instruct
 - meta-llama/Meta-Llama-3.1-70B-Instruct
 - meta-llama/Meta-Llama-3.1-8B-Instruct
 - meta-llama/Meta-Llama-3-70B-Instruct
 - meta-llama/Meta-Llama-3-8B-Instruct
 - google/gemma-2-27b-it
 - google/gemma-2-9b-it
- mistralai/Mistral-7B-Instruct-v0.2
+- deepseek-ai/DeepSeek-V2-Chat
 - mistralai/Mixtral-8x7B-Instruct-v0.1
--- a/api/core/model_runtime/model_providers/siliconflow/llm/deepdeek-coder-v2-instruct.yaml
+++ b/api/core/model_runtime/model_providers/siliconflow/llm/deepdeek-coder-v2-instruct.yaml
@ -37,3 +37,4 @@ pricing:
  output: '1.33'
  unit: '0.000001'
  currency: RMB
 deprecated: true
--- a/api/core/model_runtime/model_providers/siliconflow/llm/deepseek-v2-chat.yaml
+++ b/api/core/model_runtime/model_providers/siliconflow/llm/deepseek-v2-chat.yaml
@ -37,3 +37,4 @@ pricing:
  output: '1.33'
  unit: '0.000001'
  currency: RMB
 deprecated: true
--- a/api/core/model_runtime/model_providers/siliconflow/llm/deepseek-v2.5.yaml
+++ b/api/core/model_runtime/model_providers/siliconflow/llm/deepseek-v2.5.yaml
@ -4,6 +4,8 @@ label:
 model_type: llm
 features:
  - agent-thought
  - tool-call
  - stream-tool-call
 model_properties:
  mode: chat
  context_size: 32768
@ -32,6 +34,18 @@ parameter_rules:
    required: false
  - name: frequency_penalty
    use_template: frequency_penalty
  - name: response_format
    label:
      zh_Hans: 回复格式
      en_US: Response Format
    type: string
    help:
      zh_Hans: 指定模型必须输出的格式
      en_US: specifying the format that the model must output
    required: false
    options:
      - text
      - json_object
 pricing:
  input: '1.33'
  output: '1.33'
--- a/api/core/model_runtime/model_providers/siliconflow/llm/gemma-2-27b-it.yaml
+++ b/api/core/model_runtime/model_providers/siliconflow/llm/gemma-2-27b-it.yaml
@ -32,6 +32,18 @@ parameter_rules:
    required: false
  - name: frequency_penalty
    use_template: frequency_penalty
  - name: response_format
    label:
      zh_Hans: 回复格式
      en_US: Response Format
    type: string
    help:
      zh_Hans: 指定模型必须输出的格式
      en_US: specifying the format that the model must output
    required: false
    options:
      - text
      - json_object
 pricing:
  input: '1.26'
  output: '1.26'
--- a/api/core/model_runtime/model_providers/siliconflow/llm/gemma-2-9b-it.yaml
+++ b/api/core/model_runtime/model_providers/siliconflow/llm/gemma-2-9b-it.yaml
@ -32,6 +32,18 @@ parameter_rules:
    required: false
  - name: frequency_penalty
    use_template: frequency_penalty
  - name: response_format
    label:
      zh_Hans: 回复格式
      en_US: Response Format
    type: string
    help:
      zh_Hans: 指定模型必须输出的格式
      en_US: specifying the format that the model must output
    required: false
    options:
      - text
      - json_object
 pricing:
  input: '0'
  output: '0'
--- a/api/core/model_runtime/model_providers/siliconflow/llm/glm4-9b-chat.yaml
+++ b/api/core/model_runtime/model_providers/siliconflow/llm/glm4-9b-chat.yaml
@ -32,6 +32,18 @@ parameter_rules:
    required: false
  - name: frequency_penalty
    use_template: frequency_penalty
  - name: response_format
    label:
      zh_Hans: 回复格式
      en_US: Response Format
    type: string
    help:
      zh_Hans: 指定模型必须输出的格式
      en_US: specifying the format that the model must output
    required: false
    options:
      - text
      - json_object
 pricing:
  input: '0'
  output: '0'
--- a/api/core/model_runtime/model_providers/siliconflow/llm/hunyuan-a52b-instruct.yaml
+++ b/api/core/model_runtime/model_providers/siliconflow/llm/hunyuan-a52b-instruct.yaml
@ -0,0 +1,84 @@
 model: Tencent/Hunyuan-A52B-Instruct
 label:
  en_US: Tencent/Hunyuan-A52B-Instruct
 model_type: llm
 features:
  - agent-thought
 model_properties:
  mode: chat
  context_size: 32768
 parameter_rules:
  - name: temperature
    use_template: temperature
    type: float
    default: 0.3
    min: 0.0
    max: 2.0
    help:
      zh_Hans: 用于控制随机性和多样性的程度。具体来说，temperature值控制了生成文本时对每个候选词的概率分布进行平滑的程度。较高的temperature值会降低概率分布的峰值，使得更多的低概率词被选择，生成结果更加多样化；而较低的temperature值则会增强概率分布的峰值，使得高概率词更容易被选择，生成结果更加确定。
      en_US: Used to control the degree of randomness and diversity. Specifically, the temperature value controls the degree to which the probability distribution of each candidate word is smoothed when generating text. A higher temperature value will reduce the peak value of the probability distribution, allowing more low-probability words to be selected, and the generated results will be more diverse; while a lower temperature value will enhance the peak value of the probability distribution, making it easier for high-probability words to be selected. , the generated results are more certain.
  - name: max_tokens
    use_template: max_tokens
    type: int
    default: 2000
    min: 1
    max: 2000
    help:
      zh_Hans: 用于指定模型在生成内容时token的最大数量，它定义了生成的上限，但不保证每次都会生成到这个数量。
      en_US: It is used to specify the maximum number of tokens when the model generates content. It defines the upper limit of generation, but does not guarantee that this number will be generated every time.
  - name: top_p
    use_template: top_p
    type: float
    default: 0.8
    min: 0.1
    max: 0.9
    help:
      zh_Hans: 生成过程中核采样方法概率阈值，例如，取值为0.8时，仅保留概率加起来大于等于0.8的最可能token的最小集合作为候选集。取值范围为（0,1.0)，取值越大，生成的随机性越高；取值越低，生成的确定性越高。
      en_US: The probability threshold of the kernel sampling method during the generation process. For example, when the value is 0.8, only the smallest set of the most likely tokens with a sum of probabilities greater than or equal to 0.8 is retained as the candidate set. The value range is (0,1.0). The larger the value, the higher the randomness generated; the lower the value, the higher the certainty generated.
  - name: top_k
    type: int
    min: 0
    max: 99
    label:
      zh_Hans: 取样数量
      en_US: Top k
    help:
      zh_Hans: 生成时，采样候选集的大小。例如，取值为50时，仅将单次生成中得分最高的50个token组成随机采样的候选集。取值越大，生成的随机性越高；取值越小，生成的确定性越高。
      en_US: The size of the sample candidate set when generated. For example, when the value is 50, only the 50 highest-scoring tokens in a single generation form a randomly sampled candidate set. The larger the value, the higher the randomness generated; the smaller the value, the higher the certainty generated.
  - name: seed
    required: false
    type: int
    default: 1234
    label:
      zh_Hans: 随机种子
      en_US: Random seed
    help:
      zh_Hans: 生成时使用的随机数种子，用户控制模型生成内容的随机性。支持无符号64位整数，默认值为 1234。在使用seed时，模型将尽可能生成相同或相似的结果，但目前不保证每次生成的结果完全相同。
      en_US: The random number seed used when generating, the user controls the randomness of the content generated by the model. Supports unsigned 64-bit integers, default value is 1234. When using seed, the model will try its best to generate the same or similar results, but there is currently no guarantee that the results will be exactly the same every time.
  - name: repetition_penalty
    required: false
    type: float
    default: 1.1
    label:
      zh_Hans: 重复惩罚
      en_US: Repetition penalty
    help:
      zh_Hans: 用于控制模型生成时的重复度。提高repetition_penalty时可以降低模型生成的重复度。1.0表示不做惩罚。
      en_US: Used to control the repeatability when generating models. Increasing repetition_penalty can reduce the duplication of model generation. 1.0 means no punishment.
  - name: response_format
    label:
      zh_Hans: 回复格式
      en_US: Response Format
    type: string
    help:
      zh_Hans: 指定模型必须输出的格式
      en_US: specifying the format that the model must output
    required: false
    options:
      - text
      - json_object
 pricing:
  input: '21'
  output: '21'
  unit: '0.000001'
  currency: RMB
--- a/api/core/model_runtime/model_providers/siliconflow/llm/internlm2_5-20b-chat.yaml
+++ b/api/core/model_runtime/model_providers/siliconflow/llm/internlm2_5-20b-chat.yaml
@ -32,6 +32,18 @@ parameter_rules:
    required: false
  - name: frequency_penalty
    use_template: frequency_penalty
  - name: response_format
    label:
      zh_Hans: 回复格式
      en_US: Response Format
    type: string
    help:
      zh_Hans: 指定模型必须输出的格式
      en_US: specifying the format that the model must output
    required: false
    options:
      - text
      - json_object
 pricing:
  input: '1'
  output: '1'
--- a/api/core/model_runtime/model_providers/siliconflow/llm/internlm2_5-7b-chat.yaml
+++ b/api/core/model_runtime/model_providers/siliconflow/llm/internlm2_5-7b-chat.yaml
@ -32,6 +32,18 @@ parameter_rules:
    required: false
  - name: frequency_penalty
    use_template: frequency_penalty
  - name: response_format
    label:
      zh_Hans: 回复格式
      en_US: Response Format
    type: string
    help:
      zh_Hans: 指定模型必须输出的格式
      en_US: specifying the format that the model must output
    required: false
    options:
      - text
      - json_object
 pricing:
  input: '0'
  output: '0'
--- a/api/core/model_runtime/model_providers/siliconflow/llm/internvl2-llama3-76b.yaml
+++ b/api/core/model_runtime/model_providers/siliconflow/llm/internvl2-llama3-76b.yaml
@ -0,0 +1,84 @@
 model: OpenGVLab/InternVL2-Llama3-76B
 label:
  en_US: OpenGVLab/InternVL2-Llama3-76B
 model_type: llm
 features:
  - vision
 model_properties:
  mode: chat
  context_size: 8192
 parameter_rules:
  - name: temperature
    use_template: temperature
    type: float
    default: 0.3
    min: 0.0
    max: 2.0
    help:
      zh_Hans: 用于控制随机性和多样性的程度。具体来说，temperature值控制了生成文本时对每个候选词的概率分布进行平滑的程度。较高的temperature值会降低概率分布的峰值，使得更多的低概率词被选择，生成结果更加多样化；而较低的temperature值则会增强概率分布的峰值，使得高概率词更容易被选择，生成结果更加确定。
      en_US: Used to control the degree of randomness and diversity. Specifically, the temperature value controls the degree to which the probability distribution of each candidate word is smoothed when generating text. A higher temperature value will reduce the peak value of the probability distribution, allowing more low-probability words to be selected, and the generated results will be more diverse; while a lower temperature value will enhance the peak value of the probability distribution, making it easier for high-probability words to be selected. , the generated results are more certain.
  - name: max_tokens
    use_template: max_tokens
    type: int
    default: 2000
    min: 1
    max: 2000
    help:
      zh_Hans: 用于指定模型在生成内容时token的最大数量，它定义了生成的上限，但不保证每次都会生成到这个数量。
      en_US: It is used to specify the maximum number of tokens when the model generates content. It defines the upper limit of generation, but does not guarantee that this number will be generated every time.
  - name: top_p
    use_template: top_p
    type: float
    default: 0.8
    min: 0.1
    max: 0.9
    help:
      zh_Hans: 生成过程中核采样方法概率阈值，例如，取值为0.8时，仅保留概率加起来大于等于0.8的最可能token的最小集合作为候选集。取值范围为（0,1.0)，取值越大，生成的随机性越高；取值越低，生成的确定性越高。
      en_US: The probability threshold of the kernel sampling method during the generation process. For example, when the value is 0.8, only the smallest set of the most likely tokens with a sum of probabilities greater than or equal to 0.8 is retained as the candidate set. The value range is (0,1.0). The larger the value, the higher the randomness generated; the lower the value, the higher the certainty generated.
  - name: top_k
    type: int
    min: 0
    max: 99
    label:
      zh_Hans: 取样数量
      en_US: Top k
    help:
      zh_Hans: 生成时，采样候选集的大小。例如，取值为50时，仅将单次生成中得分最高的50个token组成随机采样的候选集。取值越大，生成的随机性越高；取值越小，生成的确定性越高。
      en_US: The size of the sample candidate set when generated. For example, when the value is 50, only the 50 highest-scoring tokens in a single generation form a randomly sampled candidate set. The larger the value, the higher the randomness generated; the smaller the value, the higher the certainty generated.
  - name: seed
    required: false
    type: int
    default: 1234
    label:
      zh_Hans: 随机种子
      en_US: Random seed
    help:
      zh_Hans: 生成时使用的随机数种子，用户控制模型生成内容的随机性。支持无符号64位整数，默认值为 1234。在使用seed时，模型将尽可能生成相同或相似的结果，但目前不保证每次生成的结果完全相同。
      en_US: The random number seed used when generating, the user controls the randomness of the content generated by the model. Supports unsigned 64-bit integers, default value is 1234. When using seed, the model will try its best to generate the same or similar results, but there is currently no guarantee that the results will be exactly the same every time.
  - name: repetition_penalty
    required: false
    type: float
    default: 1.1
    label:
      zh_Hans: 重复惩罚
      en_US: Repetition penalty
    help:
      zh_Hans: 用于控制模型生成时的重复度。提高repetition_penalty时可以降低模型生成的重复度。1.0表示不做惩罚。
      en_US: Used to control the repeatability when generating models. Increasing repetition_penalty can reduce the duplication of model generation. 1.0 means no punishment.
  - name: response_format
    label:
      zh_Hans: 回复格式
      en_US: Response Format
    type: string
    help:
      zh_Hans: 指定模型必须输出的格式
      en_US: specifying the format that the model must output
    required: false
    options:
      - text
      - json_object
 pricing:
  input: '21'
  output: '21'
  unit: '0.000001'
  currency: RMB
--- a/api/core/model_runtime/model_providers/siliconflow/llm/llm.py
+++ b/api/core/model_runtime/model_providers/siliconflow/llm/llm.py
@ -29,6 +29,9 @@ class SiliconflowLargeLanguageModel(OAIAPICompatLargeLanguageModel):
        user: Optional[str] = None,
    ) -> Union[LLMResult, Generator]:
        self._add_custom_parameters(credentials)
        # {"response_format": "json_object"} need convert to {"response_format": {"type": "json_object"}}
        if "response_format" in model_parameters:
            model_parameters["response_format"] = {"type": model_parameters.get("response_format")}
        return super()._invoke(model, credentials, prompt_messages, model_parameters, tools, stop, stream)
    def validate_credentials(self, model: str, credentials: dict) -> None:
--- a/api/core/model_runtime/model_providers/siliconflow/llm/meta-mlama-3-70b-instruct.yaml
+++ b/api/core/model_runtime/model_providers/siliconflow/llm/meta-mlama-3-70b-instruct.yaml
@ -37,3 +37,4 @@ pricing:
  output: '4.13'
  unit: '0.000001'
  currency: RMB
 deprecated: true
--- a/api/core/model_runtime/model_providers/siliconflow/llm/meta-mlama-3-8b-instruct.yaml
+++ b/api/core/model_runtime/model_providers/siliconflow/llm/meta-mlama-3-8b-instruct.yaml
@ -37,3 +37,4 @@ pricing:
  output: '0'
  unit: '0.000001'
  currency: RMB
 deprecated: true
--- a/api/core/model_runtime/model_providers/siliconflow/llm/meta-mlama-3.1-405b-instruct.yaml
+++ b/api/core/model_runtime/model_providers/siliconflow/llm/meta-mlama-3.1-405b-instruct.yaml
@ -32,6 +32,18 @@ parameter_rules:
    required: false
  - name: frequency_penalty
    use_template: frequency_penalty
  - name: response_format
    label:
      zh_Hans: 回复格式
      en_US: Response Format
    type: string
    help:
      zh_Hans: 指定模型必须输出的格式
      en_US: specifying the format that the model must output
    required: false
    options:
      - text
      - json_object
 pricing:
  input: '21'
  output: '21'
--- a/api/core/model_runtime/model_providers/siliconflow/llm/meta-mlama-3.1-70b-instruct.yaml
+++ b/api/core/model_runtime/model_providers/siliconflow/llm/meta-mlama-3.1-70b-instruct.yaml
@ -6,7 +6,7 @@ features:
  - agent-thought
 model_properties:
  mode: chat
-  context_size: 32768
+  context_size: 8192
 parameter_rules:
  - name: temperature
    use_template: temperature
@ -32,6 +32,18 @@ parameter_rules:
    required: false
  - name: frequency_penalty
    use_template: frequency_penalty
  - name: response_format
    label:
      zh_Hans: 回复格式
      en_US: Response Format
    type: string
    help:
      zh_Hans: 指定模型必须输出的格式
      en_US: specifying the format that the model must output
    required: false
    options:
      - text
      - json_object
 pricing:
  input: '4.13'
  output: '4.13'
--- a/api/core/model_runtime/model_providers/siliconflow/llm/meta-mlama-3.1-8b-instruct.yaml
+++ b/api/core/model_runtime/model_providers/siliconflow/llm/meta-mlama-3.1-8b-instruct.yaml
@ -32,6 +32,18 @@ parameter_rules:
    required: false
  - name: frequency_penalty
    use_template: frequency_penalty
  - name: response_format
    label:
      zh_Hans: 回复格式
      en_US: Response Format
    type: string
    help:
      zh_Hans: 指定模型必须输出的格式
      en_US: specifying the format that the model must output
    required: false
    options:
      - text
      - json_object
 pricing:
  input: '0'
  output: '0'
--- a/api/core/model_runtime/model_providers/siliconflow/llm/qwen2-57b-a14b-instruct.yaml
+++ b/api/core/model_runtime/model_providers/siliconflow/llm/qwen2-57b-a14b-instruct.yaml
@ -37,3 +37,4 @@ pricing:
  output: '1.26'
  unit: '0.000001'
  currency: RMB
 deprecated: true
--- a/api/core/model_runtime/model_providers/siliconflow/llm/qwen2-72b-instruct.yaml
+++ b/api/core/model_runtime/model_providers/siliconflow/llm/qwen2-72b-instruct.yaml
@ -37,3 +37,4 @@ pricing:
  output: '4.13'
  unit: '0.000001'
  currency: RMB
 deprecated: true
--- a/api/core/model_runtime/model_providers/siliconflow/llm/qwen2-7b-instruct.yaml
+++ b/api/core/model_runtime/model_providers/siliconflow/llm/qwen2-7b-instruct.yaml
@ -37,3 +37,4 @@ pricing:
  output: '0'
  unit: '0.000001'
  currency: RMB
 deprecated: true
--- a/api/core/model_runtime/model_providers/siliconflow/llm/qwen2-vl-72b-instruct.yaml
+++ b/api/core/model_runtime/model_providers/siliconflow/llm/qwen2-vl-72b-instruct.yaml
@ -0,0 +1,84 @@
 model: Qwen/Qwen2-VL-72B-Instruct
 label:
  en_US: Qwen/Qwen2-VL-72B-Instruct
 model_type: llm
 features:
  - vision
 model_properties:
  mode: chat
  context_size: 32768
 parameter_rules:
  - name: temperature
    use_template: temperature
    type: float
    default: 0.3
    min: 0.0
    max: 2.0
    help:
      zh_Hans: 用于控制随机性和多样性的程度。具体来说，temperature值控制了生成文本时对每个候选词的概率分布进行平滑的程度。较高的temperature值会降低概率分布的峰值，使得更多的低概率词被选择，生成结果更加多样化；而较低的temperature值则会增强概率分布的峰值，使得高概率词更容易被选择，生成结果更加确定。
      en_US: Used to control the degree of randomness and diversity. Specifically, the temperature value controls the degree to which the probability distribution of each candidate word is smoothed when generating text. A higher temperature value will reduce the peak value of the probability distribution, allowing more low-probability words to be selected, and the generated results will be more diverse; while a lower temperature value will enhance the peak value of the probability distribution, making it easier for high-probability words to be selected. , the generated results are more certain.
  - name: max_tokens
    use_template: max_tokens
    type: int
    default: 2000
    min: 1
    max: 2000
    help:
      zh_Hans: 用于指定模型在生成内容时token的最大数量，它定义了生成的上限，但不保证每次都会生成到这个数量。
      en_US: It is used to specify the maximum number of tokens when the model generates content. It defines the upper limit of generation, but does not guarantee that this number will be generated every time.
  - name: top_p
    use_template: top_p
    type: float
    default: 0.8
    min: 0.1
    max: 0.9
    help:
      zh_Hans: 生成过程中核采样方法概率阈值，例如，取值为0.8时，仅保留概率加起来大于等于0.8的最可能token的最小集合作为候选集。取值范围为（0,1.0)，取值越大，生成的随机性越高；取值越低，生成的确定性越高。
      en_US: The probability threshold of the kernel sampling method during the generation process. For example, when the value is 0.8, only the smallest set of the most likely tokens with a sum of probabilities greater than or equal to 0.8 is retained as the candidate set. The value range is (0,1.0). The larger the value, the higher the randomness generated; the lower the value, the higher the certainty generated.
  - name: top_k
    type: int
    min: 0
    max: 99
    label:
      zh_Hans: 取样数量
      en_US: Top k
    help:
      zh_Hans: 生成时，采样候选集的大小。例如，取值为50时，仅将单次生成中得分最高的50个token组成随机采样的候选集。取值越大，生成的随机性越高；取值越小，生成的确定性越高。
      en_US: The size of the sample candidate set when generated. For example, when the value is 50, only the 50 highest-scoring tokens in a single generation form a randomly sampled candidate set. The larger the value, the higher the randomness generated; the smaller the value, the higher the certainty generated.
  - name: seed
    required: false
    type: int
    default: 1234
    label:
      zh_Hans: 随机种子
      en_US: Random seed
    help:
      zh_Hans: 生成时使用的随机数种子，用户控制模型生成内容的随机性。支持无符号64位整数，默认值为 1234。在使用seed时，模型将尽可能生成相同或相似的结果，但目前不保证每次生成的结果完全相同。
      en_US: The random number seed used when generating, the user controls the randomness of the content generated by the model. Supports unsigned 64-bit integers, default value is 1234. When using seed, the model will try its best to generate the same or similar results, but there is currently no guarantee that the results will be exactly the same every time.
  - name: repetition_penalty
    required: false
    type: float
    default: 1.1
    label:
      zh_Hans: 重复惩罚
      en_US: Repetition penalty
    help:
      zh_Hans: 用于控制模型生成时的重复度。提高repetition_penalty时可以降低模型生成的重复度。1.0表示不做惩罚。
      en_US: Used to control the repeatability when generating models. Increasing repetition_penalty can reduce the duplication of model generation. 1.0 means no punishment.
  - name: response_format
    label:
      zh_Hans: 回复格式
      en_US: Response Format
    type: string
    help:
      zh_Hans: 指定模型必须输出的格式
      en_US: specifying the format that the model must output
    required: false
    options:
      - text
      - json_object
 pricing:
  input: '21'
  output: '21'
  unit: '0.000001'
  currency: RMB
--- a/api/core/model_runtime/model_providers/siliconflow/llm/qwen2-vl-7b-Instruct.yaml
+++ b/api/core/model_runtime/model_providers/siliconflow/llm/qwen2-vl-7b-Instruct.yaml
@ -0,0 +1,84 @@
 model: Pro/Qwen/Qwen2-VL-7B-Instruct
 label:
  en_US: Pro/Qwen/Qwen2-VL-7B-Instruct
 model_type: llm
 features:
  - vision
 model_properties:
  mode: chat
  context_size: 32768
 parameter_rules:
  - name: temperature
    use_template: temperature
    type: float
    default: 0.3
    min: 0.0
    max: 2.0
    help:
      zh_Hans: 用于控制随机性和多样性的程度。具体来说，temperature值控制了生成文本时对每个候选词的概率分布进行平滑的程度。较高的temperature值会降低概率分布的峰值，使得更多的低概率词被选择，生成结果更加多样化；而较低的temperature值则会增强概率分布的峰值，使得高概率词更容易被选择，生成结果更加确定。
      en_US: Used to control the degree of randomness and diversity. Specifically, the temperature value controls the degree to which the probability distribution of each candidate word is smoothed when generating text. A higher temperature value will reduce the peak value of the probability distribution, allowing more low-probability words to be selected, and the generated results will be more diverse; while a lower temperature value will enhance the peak value of the probability distribution, making it easier for high-probability words to be selected. , the generated results are more certain.
  - name: max_tokens
    use_template: max_tokens
    type: int
    default: 2000
    min: 1
    max: 2000
    help:
      zh_Hans: 用于指定模型在生成内容时token的最大数量，它定义了生成的上限，但不保证每次都会生成到这个数量。
      en_US: It is used to specify the maximum number of tokens when the model generates content. It defines the upper limit of generation, but does not guarantee that this number will be generated every time.
  - name: top_p
    use_template: top_p
    type: float
    default: 0.8
    min: 0.1
    max: 0.9
    help:
      zh_Hans: 生成过程中核采样方法概率阈值，例如，取值为0.8时，仅保留概率加起来大于等于0.8的最可能token的最小集合作为候选集。取值范围为（0,1.0)，取值越大，生成的随机性越高；取值越低，生成的确定性越高。
      en_US: The probability threshold of the kernel sampling method during the generation process. For example, when the value is 0.8, only the smallest set of the most likely tokens with a sum of probabilities greater than or equal to 0.8 is retained as the candidate set. The value range is (0,1.0). The larger the value, the higher the randomness generated; the lower the value, the higher the certainty generated.
  - name: top_k
    type: int
    min: 0
    max: 99
    label:
      zh_Hans: 取样数量
      en_US: Top k
    help:
      zh_Hans: 生成时，采样候选集的大小。例如，取值为50时，仅将单次生成中得分最高的50个token组成随机采样的候选集。取值越大，生成的随机性越高；取值越小，生成的确定性越高。
      en_US: The size of the sample candidate set when generated. For example, when the value is 50, only the 50 highest-scoring tokens in a single generation form a randomly sampled candidate set. The larger the value, the higher the randomness generated; the smaller the value, the higher the certainty generated.
  - name: seed
    required: false
    type: int
    default: 1234
    label:
      zh_Hans: 随机种子
      en_US: Random seed
    help:
      zh_Hans: 生成时使用的随机数种子，用户控制模型生成内容的随机性。支持无符号64位整数，默认值为 1234。在使用seed时，模型将尽可能生成相同或相似的结果，但目前不保证每次生成的结果完全相同。
      en_US: The random number seed used when generating, the user controls the randomness of the content generated by the model. Supports unsigned 64-bit integers, default value is 1234. When using seed, the model will try its best to generate the same or similar results, but there is currently no guarantee that the results will be exactly the same every time.
  - name: repetition_penalty
    required: false
    type: float
    default: 1.1
    label:
      zh_Hans: 重复惩罚
      en_US: Repetition penalty
    help:
      zh_Hans: 用于控制模型生成时的重复度。提高repetition_penalty时可以降低模型生成的重复度。1.0表示不做惩罚。
      en_US: Used to control the repeatability when generating models. Increasing repetition_penalty can reduce the duplication of model generation. 1.0 means no punishment.
  - name: response_format
    label:
      zh_Hans: 回复格式
      en_US: Response Format
    type: string
    help:
      zh_Hans: 指定模型必须输出的格式
      en_US: specifying the format that the model must output
    required: false
    options:
      - text
      - json_object
 pricing:
  input: '21'
  output: '21'
  unit: '0.000001'
  currency: RMB
--- a/api/core/model_runtime/model_providers/siliconflow/llm/qwen2.5-14b-instruct.yaml
+++ b/api/core/model_runtime/model_providers/siliconflow/llm/qwen2.5-14b-instruct.yaml
@ -32,6 +32,18 @@ parameter_rules:
    required: false
  - name: frequency_penalty
    use_template: frequency_penalty
  - name: response_format
    label:
      zh_Hans: 回复格式
      en_US: Response Format
    type: string
    help:
      zh_Hans: 指定模型必须输出的格式
      en_US: specifying the format that the model must output
    required: false
    options:
      - text
      - json_object
 pricing:
  input: '0.7'
  output: '0.7'
--- a/api/core/model_runtime/model_providers/siliconflow/llm/qwen2.5-32b-instruct.yaml
+++ b/api/core/model_runtime/model_providers/siliconflow/llm/qwen2.5-32b-instruct.yaml
@ -32,6 +32,18 @@ parameter_rules:
    required: false
  - name: frequency_penalty
    use_template: frequency_penalty
  - name: response_format
    label:
      zh_Hans: 回复格式
      en_US: Response Format
    type: string
    help:
      zh_Hans: 指定模型必须输出的格式
      en_US: specifying the format that the model must output
    required: false
    options:
      - text
      - json_object
 pricing:
  input: '1.26'
  output: '1.26'
--- a/api/core/model_runtime/model_providers/siliconflow/llm/qwen2.5-72b-instruct.yaml
+++ b/api/core/model_runtime/model_providers/siliconflow/llm/qwen2.5-72b-instruct.yaml
@ -32,6 +32,18 @@ parameter_rules:
    required: false
  - name: frequency_penalty
    use_template: frequency_penalty
  - name: response_format
    label:
      zh_Hans: 回复格式
      en_US: Response Format
    type: string
    help:
      zh_Hans: 指定模型必须输出的格式
      en_US: specifying the format that the model must output
    required: false
    options:
      - text
      - json_object
 pricing:
  input: '4.13'
  output: '4.13'
--- a/api/core/model_runtime/model_providers/siliconflow/llm/qwen2.5-7b-instruct.yaml
+++ b/api/core/model_runtime/model_providers/siliconflow/llm/qwen2.5-7b-instruct.yaml
@ -32,6 +32,18 @@ parameter_rules:
    required: false
  - name: frequency_penalty
    use_template: frequency_penalty
  - name: response_format
    label:
      zh_Hans: 回复格式
      en_US: Response Format
    type: string
    help:
      zh_Hans: 指定模型必须输出的格式
      en_US: specifying the format that the model must output
    required: false
    options:
      - text
      - json_object
 pricing:
  input: '0'
  output: '0'
--- a/api/core/model_runtime/model_providers/siliconflow/llm/qwen2.5-coder-32b-instruct.yaml
+++ b/api/core/model_runtime/model_providers/siliconflow/llm/qwen2.5-coder-32b-instruct.yaml
@ -0,0 +1,84 @@
 model: Qwen/Qwen2.5-Coder-32B-Instruct
 label:
  en_US: Qwen/Qwen2.5-Coder-32B-Instruct
 model_type: llm
 features:
  - agent-thought
 model_properties:
  mode: chat
  context_size: 32768
 parameter_rules:
  - name: temperature
    use_template: temperature
    type: float
    default: 0.3
    min: 0.0
    max: 2.0
    help:
      zh_Hans: 用于控制随机性和多样性的程度。具体来说，temperature值控制了生成文本时对每个候选词的概率分布进行平滑的程度。较高的temperature值会降低概率分布的峰值，使得更多的低概率词被选择，生成结果更加多样化；而较低的temperature值则会增强概率分布的峰值，使得高概率词更容易被选择，生成结果更加确定。
      en_US: Used to control the degree of randomness and diversity. Specifically, the temperature value controls the degree to which the probability distribution of each candidate word is smoothed when generating text. A higher temperature value will reduce the peak value of the probability distribution, allowing more low-probability words to be selected, and the generated results will be more diverse; while a lower temperature value will enhance the peak value of the probability distribution, making it easier for high-probability words to be selected. , the generated results are more certain.
  - name: max_tokens
    use_template: max_tokens
    type: int
    default: 8192
    min: 1
    max: 8192
    help:
      zh_Hans: 用于指定模型在生成内容时token的最大数量，它定义了生成的上限，但不保证每次都会生成到这个数量。
      en_US: It is used to specify the maximum number of tokens when the model generates content. It defines the upper limit of generation, but does not guarantee that this number will be generated every time.
  - name: top_p
    use_template: top_p
    type: float
    default: 0.8
    min: 0.1
    max: 0.9
    help:
      zh_Hans: 生成过程中核采样方法概率阈值，例如，取值为0.8时，仅保留概率加起来大于等于0.8的最可能token的最小集合作为候选集。取值范围为（0,1.0)，取值越大，生成的随机性越高；取值越低，生成的确定性越高。
      en_US: The probability threshold of the kernel sampling method during the generation process. For example, when the value is 0.8, only the smallest set of the most likely tokens with a sum of probabilities greater than or equal to 0.8 is retained as the candidate set. The value range is (0,1.0). The larger the value, the higher the randomness generated; the lower the value, the higher the certainty generated.
  - name: top_k
    type: int
    min: 0
    max: 99
    label:
      zh_Hans: 取样数量
      en_US: Top k
    help:
      zh_Hans: 生成时，采样候选集的大小。例如，取值为50时，仅将单次生成中得分最高的50个token组成随机采样的候选集。取值越大，生成的随机性越高；取值越小，生成的确定性越高。
      en_US: The size of the sample candidate set when generated. For example, when the value is 50, only the 50 highest-scoring tokens in a single generation form a randomly sampled candidate set. The larger the value, the higher the randomness generated; the smaller the value, the higher the certainty generated.
  - name: seed
    required: false
    type: int
    default: 1234
    label:
      zh_Hans: 随机种子
      en_US: Random seed
    help:
      zh_Hans: 生成时使用的随机数种子，用户控制模型生成内容的随机性。支持无符号64位整数，默认值为 1234。在使用seed时，模型将尽可能生成相同或相似的结果，但目前不保证每次生成的结果完全相同。
      en_US: The random number seed used when generating, the user controls the randomness of the content generated by the model. Supports unsigned 64-bit integers, default value is 1234. When using seed, the model will try its best to generate the same or similar results, but there is currently no guarantee that the results will be exactly the same every time.
  - name: repetition_penalty
    required: false
    type: float
    default: 1.1
    label:
      zh_Hans: 重复惩罚
      en_US: Repetition penalty
    help:
      zh_Hans: 用于控制模型生成时的重复度。提高repetition_penalty时可以降低模型生成的重复度。1.0表示不做惩罚。
      en_US: Used to control the repeatability when generating models. Increasing repetition_penalty can reduce the duplication of model generation. 1.0 means no punishment.
  - name: response_format
    label:
      zh_Hans: 回复格式
      en_US: Response Format
    type: string
    help:
      zh_Hans: 指定模型必须输出的格式
      en_US: specifying the format that the model must output
    required: false
    options:
      - text
      - json_object
 pricing:
  input: '1.26'
  output: '1.26'
  unit: '0.000001'
  currency: RMB
--- a/api/core/model_runtime/model_providers/siliconflow/llm/qwen2.5-coder-7b-instruct.yaml
+++ b/api/core/model_runtime/model_providers/siliconflow/llm/qwen2.5-coder-7b-instruct.yaml
@ -66,7 +66,17 @@ parameter_rules:
      zh_Hans: 用于控制模型生成时的重复度。提高repetition_penalty时可以降低模型生成的重复度。1.0表示不做惩罚。
      en_US: Used to control the repeatability when generating models. Increasing repetition_penalty can reduce the duplication of model generation. 1.0 means no punishment.
  - name: response_format
-    use_template: response_format
+    label:
      zh_Hans: 回复格式
      en_US: Response Format
    type: string
    help:
      zh_Hans: 指定模型必须输出的格式
      en_US: specifying the format that the model must output
    required: false
    options:
      - text
      - json_object
 pricing:
  input: '0'
  output: '0'
--- a/api/core/model_runtime/model_providers/siliconflow/llm/qwen2.5-math-72b-instruct.yaml
+++ b/api/core/model_runtime/model_providers/siliconflow/llm/qwen2.5-math-72b-instruct.yaml
@ -66,7 +66,17 @@ parameter_rules:
      zh_Hans: 用于控制模型生成时的重复度。提高repetition_penalty时可以降低模型生成的重复度。1.0表示不做惩罚。
      en_US: Used to control the repeatability when generating models. Increasing repetition_penalty can reduce the duplication of model generation. 1.0 means no punishment.
  - name: response_format
-    use_template: response_format
+    label:
      zh_Hans: 回复格式
      en_US: Response Format
    type: string
    help:
      zh_Hans: 指定模型必须输出的格式
      en_US: specifying the format that the model must output
    required: false
    options:
      - text
      - json_object
 pricing:
  input: '4.13'
  output: '4.13'
--- a/api/core/model_runtime/model_providers/siliconflow/speech2text/funaudio-sense-voice-small.yaml
+++ b/api/core/model_runtime/model_providers/siliconflow/speech2text/funaudio-sense-voice-small.yaml
@ -0,0 +1,5 @@
 model: FunAudioLLM/SenseVoiceSmall
 model_type: speech2text
 model_properties:
  file_upload_limit: 1
  supported_file_extensions: mp3,wav
--- a/api/core/model_runtime/model_providers/siliconflow/speech2text/sense-voice-small.yaml
+++ b/api/core/model_runtime/model_providers/siliconflow/speech2text/sense-voice-small.yaml
@ -3,3 +3,4 @@ model_type: speech2text
 model_properties:
  file_upload_limit: 1
  supported_file_extensions: mp3,wav
 deprecated: true
--- a/api/core/model_runtime/model_providers/volcengine_maas/legacy/volc_sdk/common.py
+++ b/api/core/model_runtime/model_providers/volcengine_maas/legacy/volc_sdk/common.py
@ -1,5 +1,6 @@
 import json
 import random
 from collections import UserDict
 from datetime import datetime
@ -10,9 +11,9 @@ class ChatRole:
    FUNCTION = "function"
-class _Dict(dict):
+class _Dict(UserDict):
-    __setattr__ = dict.__setitem__
+    __setattr__ = UserDict.__setitem__
-    __getattr__ = dict.__getitem__
+    __getattr__ = UserDict.__getitem__
    def __missing__(self, key):
        return None
--- a/api/core/ops/entities/trace_entity.py
+++ b/api/core/ops/entities/trace_entity.py
@ -1,5 +1,5 @@
 from datetime import datetime
-from enum import Enum
+from enum import StrEnum
 from typing import Any, Optional, Union
 from pydantic import BaseModel, ConfigDict, field_validator
@ -122,7 +122,7 @@ trace_info_info_map = {
 }
-class TraceTaskName(str, Enum):
+class TraceTaskName(StrEnum):
    CONVERSATION_TRACE = "conversation"
    WORKFLOW_TRACE = "workflow"
    MESSAGE_TRACE = "message"
--- a/api/core/ops/langfuse_trace/entities/langfuse_trace_entity.py
+++ b/api/core/ops/langfuse_trace/entities/langfuse_trace_entity.py
@ -1,5 +1,5 @@
 from datetime import datetime
-from enum import Enum
+from enum import StrEnum
 from typing import Any, Optional, Union
 from pydantic import BaseModel, ConfigDict, Field, field_validator
@ -39,7 +39,7 @@ def validate_input_output(v, field_name):
    return v
-class LevelEnum(str, Enum):
+class LevelEnum(StrEnum):
    DEBUG = "DEBUG"
    WARNING = "WARNING"
    ERROR = "ERROR"
@ -178,7 +178,7 @@ class LangfuseSpan(BaseModel):
        return validate_input_output(v, field_name)
-class UnitEnum(str, Enum):
+class UnitEnum(StrEnum):
    CHARACTERS = "CHARACTERS"
    TOKENS = "TOKENS"
    SECONDS = "SECONDS"
--- a/api/core/ops/langsmith_trace/entities/langsmith_trace_entity.py
+++ b/api/core/ops/langsmith_trace/entities/langsmith_trace_entity.py
@ -1,5 +1,5 @@
 from datetime import datetime
-from enum import Enum
+from enum import StrEnum
 from typing import Any, Optional, Union
 from pydantic import BaseModel, Field, field_validator
@ -8,7 +8,7 @@ from pydantic_core.core_schema import ValidationInfo
 from core.ops.utils import replace_text_with_content
-class LangSmithRunType(str, Enum):
+class LangSmithRunType(StrEnum):
    tool = "tool"
    chain = "chain"
    llm = "llm"
--- a/api/core/prompt/simple_prompt_transform.py
+++ b/api/core/prompt/simple_prompt_transform.py
@ -23,7 +23,7 @@ if TYPE_CHECKING:
    from core.file.models import File
-class ModelMode(str, enum.Enum):
+class ModelMode(enum.StrEnum):
    COMPLETION = "completion"
    CHAT = "chat"
--- a/api/core/rag/datasource/keyword/keyword_type.py
+++ b/api/core/rag/datasource/keyword/keyword_type.py
@ -1,5 +1,5 @@
-from enum import Enum
+from enum import StrEnum
-class KeyWordType(str, Enum):
+class KeyWordType(StrEnum):
    JIEBA = "jieba"
--- a/api/core/rag/datasource/vdb/elasticsearch/elasticsearch_vector.py
+++ b/api/core/rag/datasource/vdb/elasticsearch/elasticsearch_vector.py
@ -178,6 +178,7 @@ class ElasticSearchVector(BaseVector):
                        Field.VECTOR.value: {  # Make sure the dimension is correct here
                            "type": "dense_vector",
                            "dims": dim,
                            "index": True,
                            "similarity": "cosine",
                        },
                        Field.METADATA_KEY.value: {
--- a/api/core/rag/datasource/vdb/vector_type.py
+++ b/api/core/rag/datasource/vdb/vector_type.py
@ -1,7 +1,7 @@
-from enum import Enum
+from enum import StrEnum
-class VectorType(str, Enum):
+class VectorType(StrEnum):
    ANALYTICDB = "analyticdb"
    CHROMA = "chroma"
    MILVUS = "milvus"
--- a/api/core/rag/extractor/word_extractor.py
+++ b/api/core/rag/extractor/word_extractor.py
@ -114,10 +114,10 @@ class WordExtractor(BaseExtractor):
                    mime_type=mime_type or "",
                    created_by=self.user_id,
                    created_by_role=CreatedByRole.ACCOUNT,
-                    created_at=datetime.datetime.now(datetime.timezone.utc).replace(tzinfo=None),
+                    created_at=datetime.datetime.now(datetime.UTC).replace(tzinfo=None),
                    used=True,
                    used_by=self.user_id,
-                    used_at=datetime.datetime.now(datetime.timezone.utc).replace(tzinfo=None),
+                    used_at=datetime.datetime.now(datetime.UTC).replace(tzinfo=None),
                )
                db.session.add(upload_file)
--- a/api/core/rag/rerank/rerank_type.py
+++ b/api/core/rag/rerank/rerank_type.py
@ -1,6 +1,6 @@
-from enum import Enum
+from enum import StrEnum
-class RerankMode(str, Enum):
+class RerankMode(StrEnum):
    RERANKING_MODEL = "reranking_model"
    WEIGHTED_SCORE = "weighted_score"
--- a/api/core/tools/entities/tool_entities.py
+++ b/api/core/tools/entities/tool_entities.py
@ -1,4 +1,4 @@
-from enum import Enum
+from enum import Enum, StrEnum
 from typing import Any, Optional, Union, cast
 from pydantic import BaseModel, Field, field_validator
@ -137,7 +137,7 @@ class ToolParameterOption(BaseModel):
 class ToolParameter(BaseModel):
-    class ToolParameterType(str, Enum):
+    class ToolParameterType(StrEnum):
        STRING = "string"
        NUMBER = "number"
        BOOLEAN = "boolean"
--- a/api/core/tools/provider/_position.yaml
+++ b/api/core/tools/provider/_position.yaml
@ -78,3 +78,4 @@
 - regex
 - trello
 - vanna
 - fal
--- a/api/core/tools/provider/builtin/audio/_assets/icon.svg
+++ b/api/core/tools/provider/builtin/audio/_assets/icon.svg
@ -0,0 +1,3 @@
 <svg xmlns="http://www.w3.org/2000/svg" width="200" height="200" viewBox="0 0 200 200" fill="none">
 <path d="M167.358 102.395C167.358 117.174 157.246 129.18 144.61 131.027H137.861C125.225 129.18 115.113 117.174 115.113 102.395H100.792C100.792 123.637 115.118 142.106 133.653 145.801V164.276H147.139V145.801C165.674 142.106 180 124.558 180 102.4H167.358V102.395ZM154.717 62.677C154.717 53.4397 147.979 46.9765 140.396 46.9765C138.523 46.9446 136.663 47.3273 134.924 48.1024C133.185 48.8775 131.603 50.0294 130.27 51.4909C128.936 52.9524 127.878 54.6943 127.157 56.6148C126.436 58.5354 126.066 60.5962 126.07 62.677V78.3775H154.717V70.4478V62.677ZM126.07 102.395C126.07 111.632 132.813 118.095 140.396 118.095C142.269 118.127 144.13 117.744 145.868 116.969C147.607 116.194 149.189 115.042 150.523 113.581C151.856 112.119 152.914 110.377 153.635 108.457C154.356 106.536 154.726 104.475 154.722 102.395V86.694H126.07V102.395ZM92.1297 45.8938L70.4796 21.7595L69.4235 20.5865L59.604 20L68.3674 20.5865L67.3113 21.7654L64.1429 25.2961L63.6149 25.8826L64.1429 27.0614L66.2552 29.4133L77.8723 42.3631H54.1099C35.1 43.5361 20.3146 61.1896 20.3146 81.7874V83.5527H28.2354V81.7932C28.2354 65.8992 39.8525 52.3628 54.1099 51.1899H77.8723L66.2552 64.1338L64.671 65.8992L64.1429 67.0722L63.6149 67.6645L64.1429 68.251L68.3674 72.9606L68.8954 73.5471L69.4235 72.9606L74.1759 67.6645L92.1297 47.6591L92.6578 47.0727L92.1297 45.8938ZM20 95.8496V118.213H30.033V107.034H50.099V168.821H40.066V180H70.165V168.821H60.132V107.034H80.198V118.213H90.231V95.8496H20Z" fill="#FF0099"/>
 </svg>
--- a/api/core/tools/provider/builtin/audio/audio.py
+++ b/api/core/tools/provider/builtin/audio/audio.py
@ -0,0 +1,6 @@
 from core.tools.provider.builtin_tool_provider import BuiltinToolProviderController
 class AudioToolProvider(BuiltinToolProviderController):
    def _validate_credentials(self, credentials: dict) -> None:
        pass
--- a/api/core/tools/provider/builtin/audio/audio.yaml
+++ b/api/core/tools/provider/builtin/audio/audio.yaml
@ -0,0 +1,11 @@
 identity:
  author: hjlarry
  name: audio
  label:
    en_US: Audio
  description:
    en_US: A tool for tts and asr.
    zh_Hans: 一个用于文本转语音和语音转文本的工具。
  icon: icon.svg
  tags:
    - utilities
--- a/api/core/tools/provider/builtin/audio/tools/asr.py
+++ b/api/core/tools/provider/builtin/audio/tools/asr.py
@ -0,0 +1,69 @@
 import io
 from typing import Any
 from core.file.enums import FileType
 from core.file.file_manager import download
 from core.model_manager import ModelManager
 from core.model_runtime.entities.model_entities import ModelType
 from core.tools.entities.common_entities import I18nObject
 from core.tools.entities.tool_entities import ToolInvokeMessage, ToolParameter, ToolParameterOption
 from core.tools.tool.builtin_tool import BuiltinTool
 from services.model_provider_service import ModelProviderService
 class ASRTool(BuiltinTool):
    def _invoke(self, user_id: str, tool_parameters: dict[str, Any]) -> list[ToolInvokeMessage]:
        file = tool_parameters.get("audio_file")
        if file.type != FileType.AUDIO:
            return [self.create_text_message("not a valid audio file")]
        audio_binary = io.BytesIO(download(file))
        audio_binary.name = "temp.mp3"
        provider, model = tool_parameters.get("model").split("#")
        model_manager = ModelManager()
        model_instance = model_manager.get_model_instance(
            tenant_id=self.runtime.tenant_id,
            provider=provider,
            model_type=ModelType.SPEECH2TEXT,
            model=model,
        )
        text = model_instance.invoke_speech2text(
            file=audio_binary,
            user=user_id,
        )
        return [self.create_text_message(text)]
    def get_available_models(self) -> list[tuple[str, str]]:
        model_provider_service = ModelProviderService()
        models = model_provider_service.get_models_by_model_type(
            tenant_id=self.runtime.tenant_id, model_type="speech2text"
        )
        items = []
        for provider_model in models:
            provider = provider_model.provider
            for model in provider_model.models:
                items.append((provider, model.model))
        return items
    def get_runtime_parameters(self) -> list[ToolParameter]:
        parameters = []
        options = []
        for provider, model in self.get_available_models():
            option = ToolParameterOption(value=f"{provider}#{model}", label=I18nObject(en_US=f"{model}({provider})"))
            options.append(option)
        parameters.append(
            ToolParameter(
                name="model",
                label=I18nObject(en_US="Model", zh_Hans="Model"),
                human_description=I18nObject(
                    en_US="All available ASR models. You can config model in the Model Provider of Settings.",
                    zh_Hans="所有可用的 ASR 模型。你可以在设置中的模型供应商里配置。",
                ),
                type=ToolParameter.ToolParameterType.SELECT,
                form=ToolParameter.ToolParameterForm.FORM,
                required=True,
                options=options,
            )
        )
        return parameters
--- a/api/core/tools/provider/builtin/audio/tools/asr.yaml
+++ b/api/core/tools/provider/builtin/audio/tools/asr.yaml
@ -0,0 +1,22 @@
 identity:
  name: asr
  author: hjlarry
  label:
    en_US: Speech To Text
 description:
  human:
    en_US: Convert audio file to text.
    zh_Hans: 将音频文件转换为文本。
  llm: Convert audio file to text.
 parameters:
  - name: audio_file
    type: file
    required: true
    label:
      en_US: Audio File
      zh_Hans: 音频文件
    human_description:
      en_US: The audio file to be converted.
      zh_Hans: 要转换的音频文件。
    llm_description: The audio file to be converted.
    form: llm
--- a/api/core/tools/provider/builtin/audio/tools/tts.py
+++ b/api/core/tools/provider/builtin/audio/tools/tts.py
@ -0,0 +1,89 @@
 import io
 from typing import Any
 from core.model_manager import ModelManager
 from core.model_runtime.entities.model_entities import ModelPropertyKey, ModelType
 from core.tools.entities.common_entities import I18nObject
 from core.tools.entities.tool_entities import ToolInvokeMessage, ToolParameter, ToolParameterOption
 from core.tools.tool.builtin_tool import BuiltinTool
 from services.model_provider_service import ModelProviderService
 class TTSTool(BuiltinTool):
    def _invoke(self, user_id: str, tool_parameters: dict[str, Any]) -> list[ToolInvokeMessage]:
        provider, model = tool_parameters.get("model").split("#")
        voice = tool_parameters.get(f"voice#{provider}#{model}")
        model_manager = ModelManager()
        model_instance = model_manager.get_model_instance(
            tenant_id=self.runtime.tenant_id,
            provider=provider,
            model_type=ModelType.TTS,
            model=model,
        )
        tts = model_instance.invoke_tts(
            content_text=tool_parameters.get("text"),
            user=user_id,
            tenant_id=self.runtime.tenant_id,
            voice=voice,
        )
        buffer = io.BytesIO()
        for chunk in tts:
            buffer.write(chunk)
        wav_bytes = buffer.getvalue()
        return [
            self.create_text_message("Audio generated successfully"),
            self.create_blob_message(
                blob=wav_bytes,
                meta={"mime_type": "audio/x-wav"},
                save_as=self.VariableKey.AUDIO,
            ),
        ]
    def get_available_models(self) -> list[tuple[str, str, list[Any]]]:
        model_provider_service = ModelProviderService()
        models = model_provider_service.get_models_by_model_type(tenant_id=self.runtime.tenant_id, model_type="tts")
        items = []
        for provider_model in models:
            provider = provider_model.provider
            for model in provider_model.models:
                voices = model.model_properties.get(ModelPropertyKey.VOICES, [])
                items.append((provider, model.model, voices))
        return items
    def get_runtime_parameters(self) -> list[ToolParameter]:
        parameters = []
        options = []
        for provider, model, voices in self.get_available_models():
            option = ToolParameterOption(value=f"{provider}#{model}", label=I18nObject(en_US=f"{model}({provider})"))
            options.append(option)
            parameters.append(
                ToolParameter(
                    name=f"voice#{provider}#{model}",
                    label=I18nObject(en_US=f"Voice of {model}({provider})"),
                    type=ToolParameter.ToolParameterType.SELECT,
                    form=ToolParameter.ToolParameterForm.FORM,
                    options=[
                        ToolParameterOption(value=voice.get("mode"), label=I18nObject(en_US=voice.get("name")))
                        for voice in voices
                    ],
                )
            )
        parameters.insert(
            0,
            ToolParameter(
                name="model",
                label=I18nObject(en_US="Model", zh_Hans="Model"),
                human_description=I18nObject(
                    en_US="All available TTS models. You can config model in the Model Provider of Settings.",
                    zh_Hans="所有可用的 TTS 模型。你可以在设置中的模型供应商里配置。",
                ),
                type=ToolParameter.ToolParameterType.SELECT,
                form=ToolParameter.ToolParameterForm.FORM,
                required=True,
                options=options,
            ),
        )
        return parameters
--- a/api/core/tools/provider/builtin/audio/tools/tts.yaml
+++ b/api/core/tools/provider/builtin/audio/tools/tts.yaml
@ -0,0 +1,22 @@
 identity:
  name: tts
  author: hjlarry
  label:
    en_US: Text To Speech
 description:
  human:
    en_US: Convert text to audio file.
    zh_Hans: 将文本转换为音频文件。
  llm: Convert text to audio file.
 parameters:
  - name: text
    type: string
    required: true
    label:
      en_US: Text
      zh_Hans: 文本
    human_description:
      en_US: The text to be converted.
      zh_Hans: 要转换的文本。
    llm_description: The text to be converted.
    form: llm
--- a/api/core/tools/provider/builtin/email/_assets/icon.svg
+++ b/api/core/tools/provider/builtin/email/_assets/icon.svg
--- a/api/core/tools/provider/builtin/email/email.py
+++ b/api/core/tools/provider/builtin/email/email.py
@ -0,0 +1,7 @@
 from core.tools.provider.builtin.email.tools.send_mail import SendMailTool
 from core.tools.provider.builtin_tool_provider import BuiltinToolProviderController
 class SmtpProvider(BuiltinToolProviderController):
    def _validate_credentials(self, credentials: dict) -> None:
        SendMailTool()
--- a/api/core/tools/provider/builtin/email/email.yaml
+++ b/api/core/tools/provider/builtin/email/email.yaml
@ -0,0 +1,83 @@
 identity:
  author: wakaka6
  name: email
  label:
    en_US: email
    zh_Hans: 电子邮件
  description:
    en_US: send email through smtp protocol
    zh_Hans: 通过smtp协议发送电子邮件
  icon: icon.svg
  tags:
    - utilities
 credentials_for_provider:
  email_account:
    type: text-input
    required: true
    label:
      en_US: email account
      zh_Hans: 邮件账号
    placeholder:
      en_US: input you email account
      zh_Hans: 输入你的邮箱账号
    help:
      en_US: email account
      zh_Hans: 邮件账号
  email_password:
    type: secret-input
    required: true
    label:
      en_US: email password
      zh_Hans: 邮件密码
    placeholder:
      en_US: email password
      zh_Hans: 邮件密码
    help:
      en_US: email password
      zh_Hans: 邮件密码
  smtp_server:
    type: text-input
    required: true
    label:
      en_US: smtp server
      zh_Hans: 发信smtp服务器地址
    placeholder:
      en_US: smtp server
      zh_Hans: 发信smtp服务器地址
    help:
      en_US: smtp server
      zh_Hans: 发信smtp服务器地址
  smtp_port:
    type: text-input
    required: true
    label:
      en_US: smtp server port
      zh_Hans: 发信smtp服务器端口
    placeholder:
      en_US: smtp server port
      zh_Hans: 发信smtp服务器端口
    help:
      en_US: smtp server port
      zh_Hans: 发信smtp服务器端口
  encrypt_method:
    type: select
    required: true
    options:
      - value: NONE
        label:
          en_US: NONE
          zh_Hans: 无加密
      - value: SSL
        label:
          en_US: SSL
          zh_Hans: SSL加密
      - value: TLS
        label:
          en_US: START TLS
          zh_Hans: START TLS加密
    label:
      en_US: encrypt method
      zh_Hans: 加密方式
    help:
      en_US: smtp server encrypt method
      zh_Hans: 发信smtp服务器加密方式
--- a/api/core/tools/provider/builtin/email/tools/send.py
+++ b/api/core/tools/provider/builtin/email/tools/send.py
@ -0,0 +1,53 @@
 import logging
 import smtplib
 import ssl
 from email.mime.multipart import MIMEMultipart
 from email.mime.text import MIMEText
 from pydantic import BaseModel
 class SendEmailToolParameters(BaseModel):
    smtp_server: str
    smtp_port: int
    email_account: str
    email_password: str
    sender_to: str
    subject: str
    email_content: str
    encrypt_method: str
 def send_mail(parmas: SendEmailToolParameters):
    timeout = 60
    msg = MIMEMultipart("alternative")
    msg["From"] = parmas.email_account
    msg["To"] = parmas.sender_to
    msg["Subject"] = parmas.subject
    msg.attach(MIMEText(parmas.email_content, "plain"))
    msg.attach(MIMEText(parmas.email_content, "html"))
    ctx = ssl.create_default_context()
    if parmas.encrypt_method.upper() == "SSL":
        try:
            with smtplib.SMTP_SSL(parmas.smtp_server, parmas.smtp_port, context=ctx, timeout=timeout) as server:
                server.login(parmas.email_account, parmas.email_password)
                server.sendmail(parmas.email_account, parmas.sender_to, msg.as_string())
                return True
        except Exception as e:
            logging.exception("send email failed: %s", e)
            return False
    else:  # NONE or TLS
        try:
            with smtplib.SMTP(parmas.smtp_server, parmas.smtp_port, timeout=timeout) as server:
                if parmas.encrypt_method.upper() == "TLS":
                    server.starttls(context=ctx)
                server.login(parmas.email_account, parmas.email_password)
                server.sendmail(parmas.email_account, parmas.sender_to, msg.as_string())
                return True
        except Exception as e:
            logging.exception("send email failed: %s", e)
            return False
--- a/api/core/tools/provider/builtin/email/tools/send_mail.py
+++ b/api/core/tools/provider/builtin/email/tools/send_mail.py
@ -0,0 +1,66 @@
 import re
 from typing import Any, Union
 from core.tools.entities.tool_entities import ToolInvokeMessage
 from core.tools.provider.builtin.email.tools.send import (
    SendEmailToolParameters,
    send_mail,
 )
 from core.tools.tool.builtin_tool import BuiltinTool
 class SendMailTool(BuiltinTool):
    def _invoke(
        self, user_id: str, tool_parameters: dict[str, Any]
    ) -> Union[ToolInvokeMessage, list[ToolInvokeMessage]]:
        """
        invoke tools
        """
        sender = self.runtime.credentials.get("email_account", "")
        email_rgx = re.compile(r"^[a-zA-Z0-9_-]+@[a-zA-Z0-9_-]+(\.[a-zA-Z0-9_-]+)+$")
        password = self.runtime.credentials.get("email_password", "")
        smtp_server = self.runtime.credentials.get("smtp_server", "")
        if not smtp_server:
            return self.create_text_message("please input smtp server")
        smtp_port = self.runtime.credentials.get("smtp_port", "")
        try:
            smtp_port = int(smtp_port)
        except ValueError:
            return self.create_text_message("Invalid parameter smtp_port(should be int)")
        if not sender:
            return self.create_text_message("please input sender")
        if not email_rgx.match(sender):
            return self.create_text_message("Invalid parameter userid, the sender is not a mailbox")
        receiver_email = tool_parameters["send_to"]
        if not receiver_email:
            return self.create_text_message("please input receiver email")
        if not email_rgx.match(receiver_email):
            return self.create_text_message("Invalid parameter receiver email, the receiver email is not a mailbox")
        email_content = tool_parameters.get("email_content", "")
        if not email_content:
            return self.create_text_message("please input email content")
        subject = tool_parameters.get("subject", "")
        if not subject:
            return self.create_text_message("please input email subject")
        encrypt_method = self.runtime.credentials.get("encrypt_method", "")
        if not encrypt_method:
            return self.create_text_message("please input encrypt method")
        send_email_params = SendEmailToolParameters(
            smtp_server=smtp_server,
            smtp_port=smtp_port,
            email_account=sender,
            email_password=password,
            sender_to=receiver_email,
            subject=subject,
            email_content=email_content,
            encrypt_method=encrypt_method,
        )
        if send_mail(send_email_params):
            return self.create_text_message("send email success")
        return self.create_text_message("send email failed")
--- a/api/core/tools/provider/builtin/email/tools/send_mail.yaml
+++ b/api/core/tools/provider/builtin/email/tools/send_mail.yaml
@ -0,0 +1,46 @@
 identity:
  name: send_mail
  author: wakaka6
  label:
    en_US: send email
    zh_Hans: 发送邮件
  icon: icon.svg
 description:
  human:
    en_US: A tool for sending email
    zh_Hans: 用于发送邮件
  llm: A tool for sending email
 parameters:
  - name: send_to
    type: string
    required: true
    label:
      en_US: Recipient email account
      zh_Hans: 收件人邮箱账号
    human_description:
      en_US: Recipient email account
      zh_Hans: 收件人邮箱账号
    llm_description: Recipient email account
    form: llm
  - name: subject
    type: string
    required: true
    label:
      en_US: email subject
      zh_Hans: 邮件主题
    human_description:
      en_US: email subject
      zh_Hans: 邮件主题
    llm_description: email subject
    form: llm
  - name: email_content
    type: string
    required: true
    label:
      en_US: email content
      zh_Hans: 邮件内容
    human_description:
      en_US: email content
      zh_Hans: 邮件内容
    llm_description: email content
    form: llm
--- a/api/core/tools/provider/builtin/email/tools/send_mail_batch.py
+++ b/api/core/tools/provider/builtin/email/tools/send_mail_batch.py
@ -0,0 +1,75 @@
 import json
 import re
 from typing import Any, Union
 from core.tools.entities.tool_entities import ToolInvokeMessage
 from core.tools.provider.builtin.email.tools.send import (
    SendEmailToolParameters,
    send_mail,
 )
 from core.tools.tool.builtin_tool import BuiltinTool
 class SendMailTool(BuiltinTool):
    def _invoke(
        self, user_id: str, tool_parameters: dict[str, Any]
    ) -> Union[ToolInvokeMessage, list[ToolInvokeMessage]]:
        """
        invoke tools
        """
        sender = self.runtime.credentials.get("email_account", "")
        email_rgx = re.compile(r"^[a-zA-Z0-9_-]+@[a-zA-Z0-9_-]+(\.[a-zA-Z0-9_-]+)+$")
        password = self.runtime.credentials.get("email_password", "")
        smtp_server = self.runtime.credentials.get("smtp_server", "")
        if not smtp_server:
            return self.create_text_message("please input smtp server")
        smtp_port = self.runtime.credentials.get("smtp_port", "")
        try:
            smtp_port = int(smtp_port)
        except ValueError:
            return self.create_text_message("Invalid parameter smtp_port(should be int)")
        if not sender:
            return self.create_text_message("please input sender")
        if not email_rgx.match(sender):
            return self.create_text_message("Invalid parameter userid, the sender is not a mailbox")
        receivers_email = tool_parameters["send_to"]
        if not receivers_email:
            return self.create_text_message("please input receiver email")
        receivers_email = json.loads(receivers_email)
        for receiver in receivers_email:
            if not email_rgx.match(receiver):
                return self.create_text_message(
                    f"Invalid parameter receiver email, the receiver email({receiver}) is not a mailbox"
                )
        email_content = tool_parameters.get("email_content", "")
        if not email_content:
            return self.create_text_message("please input email content")
        subject = tool_parameters.get("subject", "")
        if not subject:
            return self.create_text_message("please input email subject")
        encrypt_method = self.runtime.credentials.get("encrypt_method", "")
        if not encrypt_method:
            return self.create_text_message("please input encrypt method")
        msg = {}
        for receiver in receivers_email:
            send_email_params = SendEmailToolParameters(
                smtp_server=smtp_server,
                smtp_port=smtp_port,
                email_account=sender,
                email_password=password,
                sender_to=receiver,
                subject=subject,
                email_content=email_content,
                encrypt_method=encrypt_method,
            )
            if send_mail(send_email_params):
                msg[receiver] = "send email success"
            else:
                msg[receiver] = "send email failed"
        return self.create_text_message(json.dumps(msg))
--- a/api/core/tools/provider/builtin/email/tools/send_mail_batch.yaml
+++ b/api/core/tools/provider/builtin/email/tools/send_mail_batch.yaml
@ -0,0 +1,46 @@
 identity:
  name: send_mail_batch
  author: wakaka6
  label:
    en_US: send email to multiple recipients
    zh_Hans: 发送邮件给多个收件人
  icon: icon.svg
 description:
  human:
    en_US: A tool for sending email to multiple recipients
    zh_Hans: 用于发送邮件给多个收件人的工具
  llm: A tool for sending email to multiple recipients
 parameters:
  - name: send_to
    type: string
    required: true
    label:
      en_US: Recipient email account(json list)
      zh_Hans: 收件人邮箱账号(json list)
    human_description:
      en_US: Recipient email account
      zh_Hans: 收件人邮箱账号
    llm_description: A list of recipient email account(json format)
    form: llm
  - name: subject
    type: string
    required: true
    label:
      en_US: email subject
      zh_Hans: 邮件主题
    human_description:
      en_US: email subject
      zh_Hans: 邮件主题
    llm_description: email subject
    form: llm
  - name: email_content
    type: string
    required: true
    label:
      en_US: email content
      zh_Hans: 邮件内容
    human_description:
      en_US: email content
      zh_Hans: 邮件内容
    llm_description: email content
    form: llm
--- a/api/core/tools/provider/builtin/fal/_assets/icon.svg
+++ b/api/core/tools/provider/builtin/fal/_assets/icon.svg
@ -0,0 +1,4 @@
 <?xml version="1.0" encoding="UTF-8"?>
 <svg version="1.1" xmlns="http://www.w3.org/2000/svg" width="32" height="32">
 <path d="M0 0 C3.96 0 7.92 0 12 0 C12.4125 0.928125 12.825 1.85625 13.25 2.8125 C15.56104487 7.02190315 17.49701732 8.49900577 22 10 C22 13.96 22 17.92 22 22 C21.071875 22.4125 20.14375 22.825 19.1875 23.25 C14.97809685 25.56104487 13.50099423 27.49701732 12 32 C8.04 32 4.08 32 0 32 C-0.4125 31.071875 -0.825 30.14375 -1.25 29.1875 C-3.56104487 24.97809685 -5.49701732 23.50099423 -10 22 C-10 18.04 -10 14.08 -10 10 C-9.071875 9.5875 -8.14375 9.175 -7.1875 8.75 C-2.97809685 6.43895513 -1.50099423 4.50298268 0 0 Z M-2 11 C-3.42662219 13.85324437 -3.31033868 15.83454549 -3 19 C-1.20006226 21.69990662 0.083773 23.5418865 3 25 C7.1364408 25.56406011 8.76045933 25.14638597 12.375 22.9375 C15.26054626 20.20817124 15.26054626 20.20817124 15.6875 16.5625 C14.76325283 11.77321919 13.68514918 10.2147046 10 7 C4.54838272 6.02649691 1.87056683 7.12943317 -2 11 Z " fill="#EC0648" transform="translate(10,0)"/>
 </svg>
--- a/api/core/tools/provider/builtin/fal/fal.py
+++ b/api/core/tools/provider/builtin/fal/fal.py
@ -0,0 +1,20 @@
 import requests
 from core.tools.errors import ToolProviderCredentialValidationError
 from core.tools.provider.builtin_tool_provider import BuiltinToolProviderController
 class FalProvider(BuiltinToolProviderController):
    def _validate_credentials(self, credentials: dict) -> None:
        url = "https://fal.run/fal-ai/flux/dev"
        headers = {
            "Authorization": f"Key {credentials.get('fal_api_key')}",
            "Content-Type": "application/json",
        }
        data = {"prompt": "Cat"}
        response = requests.post(url, json=data, headers=headers)
        if response.status_code == 401:
            raise ToolProviderCredentialValidationError("FAL API key is invalid")
        elif response.status_code != 200:
            raise ToolProviderCredentialValidationError(f"FAL API key validation failed: {response.text}")
--- a/api/core/tools/provider/builtin/fal/fal.yaml
+++ b/api/core/tools/provider/builtin/fal/fal.yaml
@ -0,0 +1,21 @@
 identity:
  author: Kalo Chin
  name: fal
  label:
    en_US: FAL
    zh_CN: FAL
  description:
    en_US: The image generation API provided by FAL.
    zh_CN: FAL 提供的图像生成 API。
  icon: icon.svg
  tags:
    - image
 credentials_for_provider:
  fal_api_key:
    type: secret-input
    required: true
    label:
      en_US: FAL API Key
    placeholder:
      en_US: Please input your FAL API key
    url: https://fal.ai/dashboard/keys
--- a/api/core/tools/provider/builtin/fal/tools/flux_1_1_pro.py
+++ b/api/core/tools/provider/builtin/fal/tools/flux_1_1_pro.py
@ -0,0 +1,46 @@
 from typing import Any, Union
 import requests
 from core.tools.entities.tool_entities import ToolInvokeMessage
 from core.tools.tool.builtin_tool import BuiltinTool
 class Flux11ProTool(BuiltinTool):
    def _invoke(
        self, user_id: str, tool_parameters: dict[str, Any]
    ) -> Union[ToolInvokeMessage, list[ToolInvokeMessage]]:
        headers = {
            "Authorization": f"Key {self.runtime.credentials['fal_api_key']}",
            "Content-Type": "application/json",
        }
        prompt = tool_parameters.get("prompt", "")
        sanitized_prompt = prompt.replace("\\", "")  # Remove backslashes from the prompt which may cause errors
        payload = {
            "prompt": sanitized_prompt,
            "image_size": tool_parameters.get("image_size", "landscape_4_3"),
            "seed": tool_parameters.get("seed"),
            "sync_mode": tool_parameters.get("sync_mode", False),
            "num_images": tool_parameters.get("num_images", 1),
            "enable_safety_checker": tool_parameters.get("enable_safety_checker", True),
            "safety_tolerance": tool_parameters.get("safety_tolerance", "2"),
        }
        url = "https://fal.run/fal-ai/flux-pro/v1.1"
        response = requests.post(url, json=payload, headers=headers)
        if response.status_code != 200:
            return self.create_text_message(f"Got Error Response: {response.text}")
        res = response.json()
        result = [self.create_json_message(res)]
        for image_info in res.get("images", []):
            image_url = image_info.get("url")
            if image_url:
                result.append(self.create_image_message(image=image_url, save_as=self.VariableKey.IMAGE.value))
        return result
--- a/api/core/tools/provider/builtin/fal/tools/flux_1_1_pro.yaml
+++ b/api/core/tools/provider/builtin/fal/tools/flux_1_1_pro.yaml
@ -0,0 +1,147 @@
 identity:
  name: flux_1_1_pro
  author: Kalo Chin
  label:
    en_US: FLUX 1.1 [pro]
    zh_Hans: FLUX 1.1 [pro]
  icon: icon.svg
 description:
  human:
    en_US: FLUX 1.1 [pro] is an enhanced version of FLUX.1 [pro], improved image generation capabilities, delivering superior composition, detail, and artistic fidelity compared to its predecessor.
    zh_Hans: FLUX 1.1 [pro] 是 FLUX.1 [pro] 的增强版，改进了图像生成能力，与其前身相比，提供了更出色的构图、细节和艺术保真度。
  llm: This tool generates images from prompts using FAL's FLUX 1.1 [pro] model.
 parameters:
  - name: prompt
    type: string
    required: true
    label:
      en_US: Prompt
      zh_Hans: 提示词
    human_description:
      en_US: The text prompt used to generate the image.
      zh_Hans: 用于生成图片的文字提示词。
    llm_description: This prompt text will be used to generate the image.
    form: llm
  - name: image_size
    type: select
    required: false
    options:
      - value: square_hd
        label:
          en_US: Square HD
          zh_Hans: 方形高清
      - value: square
        label:
          en_US: Square
          zh_Hans: 方形
      - value: portrait_4_3
        label:
          en_US: Portrait 4:3
          zh_Hans: 竖屏 4:3
      - value: portrait_16_9
        label:
          en_US: Portrait 16:9
          zh_Hans: 竖屏 16:9
      - value: landscape_4_3
        label:
          en_US: Landscape 4:3
          zh_Hans: 横屏 4:3
      - value: landscape_16_9
        label:
          en_US: Landscape 16:9
          zh_Hans: 横屏 16:9
    default: landscape_4_3
    label:
      en_US: Image Size
      zh_Hans: 图片大小
    human_description:
      en_US: The size of the generated image.
      zh_Hans: 生成图像的尺寸。
    form: form
  - name: num_images
    type: number
    required: false
    default: 1
    min: 1
    max: 1
    label:
      en_US: Number of Images
      zh_Hans: 图片数量
    human_description:
      en_US: The number of images to generate.
      zh_Hans: 要生成的图片数量。
    form: form
  - name: safety_tolerance
    type: select
    required: false
    options:
      - value: "1"
        label:
          en_US: "1 (Most strict)"
          zh_Hans: "1（最严格）"
      - value: "2"
        label:
          en_US: "2"
          zh_Hans: "2"
      - value: "3"
        label:
          en_US: "3"
          zh_Hans: "3"
      - value: "4"
        label:
          en_US: "4"
          zh_Hans: "4"
      - value: "5"
        label:
          en_US: "5"
          zh_Hans: "5"
      - value: "6"
        label:
          en_US: "6 (Most permissive)"
          zh_Hans: "6（最宽松）"
    default: "2"
    label:
      en_US: Safety Tolerance
      zh_Hans: 安全容忍度
    human_description:
      en_US: The safety tolerance level for the generated image. 1 being the most strict and 6 being the most permissive.
      zh_Hans: 生成图像的安全容忍级别，1 为最严格，6 为最宽松。
    form: form
  - name: seed
    type: number
    required: false
    min: 0
    max: 9999999999
    label:
      en_US: Seed
      zh_Hans: 种子
    human_description:
      en_US: The same seed and prompt can produce similar images.
      zh_Hans: 相同的种子和提示词可以产生相似的图像。
    form: form
  - name: enable_safety_checker
    type: boolean
    required: false
    default: true
    label:
      en_US: Enable Safety Checker
      zh_Hans: 启用安全检查器
    human_description:
      en_US: Enable or disable the safety checker.
      zh_Hans: 启用或禁用安全检查器。
    form: form
  - name: sync_mode
    type: boolean
    required: false
    default: false
    label:
      en_US: Sync Mode
      zh_Hans: 同步模式
    human_description:
      en_US: >
        If set to true, the function will wait for the image to be generated and uploaded before returning the response.
        This will increase the latency but allows you to get the image directly in the response without going through the CDN.
      zh_Hans: >
        如果设置为 true，函数将在生成并上传图像后再返回响应。
        这将增加函数的延迟，但可以让您直接在响应中获取图像，而无需通过 CDN。
    form: form
--- a/Show More
+++ b/Show More
Author	SHA1	Message	Date
Bowen Liang	814c9fab1c	update	2024-11-15 14:59:20 +08:00
Bowen Liang	3e1be72db8	update	2024-11-15 14:59:20 +08:00
Bowen Liang	fc37240b4f	update	2024-11-15 14:59:20 +08:00
Bowen Liang	71029e4d6c	update logs	2024-11-15 14:59:20 +08:00
Bowen Liang	b136e7821b	update doc	2024-11-15 14:59:20 +08:00
Bowen Liang	c3f1b9978a	fix UP042 by using StrEnum	2024-11-15 14:59:20 +08:00
Bowen Liang	fec99fcc5e	auto fixes by ruff	2024-11-15 14:59:20 +08:00
Bowen Liang	d9216b686f	update	2024-11-15 14:59:20 +08:00
Bowen Liang	4c151e1c25	update api doc	2024-11-15 14:59:16 +08:00
非法操作	2a5c5a4e15	fix: remove default model selection for audio tool (#10729 )	2024-11-15 12:40:41 +08:00
非法操作	4b2abf8ac2	fix: create_blob_message of tool will always create image type file (#10701 ) Some checks are pending Build and Push API & Web / build (api, DIFY_API_IMAGE_NAME, linux/amd64, build-api-amd64) (push) Waiting to run Details Build and Push API & Web / build (api, DIFY_API_IMAGE_NAME, linux/arm64, build-api-arm64) (push) Waiting to run Details Build and Push API & Web / build (web, DIFY_WEB_IMAGE_NAME, linux/amd64, build-web-amd64) (push) Waiting to run Details Build and Push API & Web / build (web, DIFY_WEB_IMAGE_NAME, linux/arm64, build-web-arm64) (push) Waiting to run Details Build and Push API & Web / create-manifest (api, DIFY_API_IMAGE_NAME, merge-api-images) (push) Blocked by required conditions Details Build and Push API & Web / create-manifest (web, DIFY_WEB_IMAGE_NAME, merge-web-images) (push) Blocked by required conditions Details	2024-11-15 10:38:12 +08:00
Bowen Liang	365cb4b368	chore(lint): bump ruff from 0.6.9 to 0.7.3 (#10714 )	2024-11-15 09:19:41 +08:00
GeorgeCaoJ	c85bff235d	fix(i18n): handle key naming error (#10713 )	2024-11-15 09:01:38 +08:00
Kalo Chin	ad16180b1a	feat(tool): fal ai wizper ASR built-in tool (#10716 )	2024-11-15 09:01:07 +08:00
jarvis2f	5ff02b469f	fix:position error when creating segments (#10706 ) Some checks are pending Build and Push API & Web / build (api, DIFY_API_IMAGE_NAME, linux/amd64, build-api-amd64) (push) Waiting to run Details Build and Push API & Web / build (api, DIFY_API_IMAGE_NAME, linux/arm64, build-api-arm64) (push) Waiting to run Details Build and Push API & Web / build (web, DIFY_WEB_IMAGE_NAME, linux/amd64, build-web-amd64) (push) Waiting to run Details Build and Push API & Web / build (web, DIFY_WEB_IMAGE_NAME, linux/arm64, build-web-arm64) (push) Waiting to run Details Build and Push API & Web / create-manifest (api, DIFY_API_IMAGE_NAME, merge-api-images) (push) Blocked by required conditions Details Build and Push API & Web / create-manifest (web, DIFY_WEB_IMAGE_NAME, merge-web-images) (push) Blocked by required conditions Details	2024-11-14 21:25:15 +08:00
Bowen Liang	44f57ad9a8	chore: Bump Alpine Linux to 3.20 in web dockerfile (#10671 )	2024-11-14 20:57:01 +08:00
yihong	94fd6f6901	fix: typo in test (#10707 ) Signed-off-by: yihong0618 <zouzou0208@gmail.com>	2024-11-14 20:54:13 +08:00
SiliconFlow, Inc	e61242a337	feat: add vlm models from siliconflow (#10704 )	2024-11-14 20:53:35 +08:00
yihong	722964667f	fix: non utf8 code decode close #10691 (#10698 ) Signed-off-by: yihong0618 <zouzou0208@gmail.com>	2024-11-14 17:29:49 +08:00
Xiao Ley	fbb9c1c249	fixed the Base URL usage issue in Podcast Generator tool verification (#10697 )	2024-11-14 17:24:42 +08:00
非法操作	15f341b655	feat: add the audio tool (#10695 )	2024-11-14 16:37:15 +08:00
crazywoola	b358490607	chore: update issue template (#10693 )	2024-11-14 16:12:27 +08:00
crazywoola	f9e4196fd5	Update pull_request_template.md (#10692 )	2024-11-14 15:56:37 +08:00
crazywoola	751525802d	feat: update pr template (#10690 )	2024-11-14 15:52:15 +08:00
lz	2abacd2a2d	export configuration 'CODE_EXECUTION_TIMEOUT' to .env (#10688 ) Co-authored-by: liuzhu <liuzhu@fridaycloud.com.cn>	2024-11-14 15:34:34 +08:00
Nam Vu	a3155e0613	Update expat version (#10686 )	2024-11-14 15:30:55 +08:00
Jyong	70b9e4caf5	check dataset is none (#10682 )	2024-11-14 14:07:19 +08:00
orangeclk	317ae9233e	feat: add json response format for siliconflow models (#10657 ) Some checks are pending Build and Push API & Web / build (api, DIFY_API_IMAGE_NAME, linux/amd64, build-api-amd64) (push) Waiting to run Details Build and Push API & Web / build (api, DIFY_API_IMAGE_NAME, linux/arm64, build-api-arm64) (push) Waiting to run Details Build and Push API & Web / build (web, DIFY_WEB_IMAGE_NAME, linux/amd64, build-web-amd64) (push) Waiting to run Details Build and Push API & Web / build (web, DIFY_WEB_IMAGE_NAME, linux/arm64, build-web-arm64) (push) Waiting to run Details Build and Push API & Web / create-manifest (api, DIFY_API_IMAGE_NAME, merge-api-images) (push) Blocked by required conditions Details Build and Push API & Web / create-manifest (web, DIFY_WEB_IMAGE_NAME, merge-web-images) (push) Blocked by required conditions Details	2024-11-14 08:58:22 +08:00
xiandan-erizo	5b8f03cd9d	add abab7-chat-preview model (#10654 ) Some checks failed Build and Push API & Web / build (api, DIFY_API_IMAGE_NAME, linux/amd64, build-api-amd64) (push) Waiting to run Details Build and Push API & Web / build (api, DIFY_API_IMAGE_NAME, linux/arm64, build-api-arm64) (push) Waiting to run Details Build and Push API & Web / build (web, DIFY_WEB_IMAGE_NAME, linux/amd64, build-web-amd64) (push) Waiting to run Details Build and Push API & Web / build (web, DIFY_WEB_IMAGE_NAME, linux/arm64, build-web-arm64) (push) Waiting to run Details Build and Push API & Web / create-manifest (api, DIFY_API_IMAGE_NAME, merge-api-images) (push) Blocked by required conditions Details Build and Push API & Web / create-manifest (web, DIFY_WEB_IMAGE_NAME, merge-web-images) (push) Blocked by required conditions Details Mark stale issues and pull requests / stale (push) Has been cancelled Details Co-authored-by: xiandan-erizo <xiandan-erizo@outlook.com>	2024-11-13 19:30:42 +08:00
Kalo Chin	2a4783307a	Feat(tool): fal ai flux image generation (#10606 )	2024-11-13 17:41:58 +08:00
非法操作	bddecba9ed	fix: `mp3` file upload not work (#10650 )	2024-11-13 17:37:29 +08:00
jiangbo721	931e76e3d1	fix: remove unused queue `generation` (#10532 ) Co-authored-by: 刘江波 <jiangbo721@163.com>	2024-11-13 15:52:52 +08:00
-LAN-	70c2ec8ed5	feat(variable-handling): enhance variable and segment conversion (#10483 ) Some checks are pending Build and Push API & Web / build (api, DIFY_API_IMAGE_NAME, linux/amd64, build-api-amd64) (push) Waiting to run Details Build and Push API & Web / build (api, DIFY_API_IMAGE_NAME, linux/arm64, build-api-arm64) (push) Waiting to run Details Build and Push API & Web / build (web, DIFY_WEB_IMAGE_NAME, linux/amd64, build-web-amd64) (push) Waiting to run Details Build and Push API & Web / build (web, DIFY_WEB_IMAGE_NAME, linux/arm64, build-web-arm64) (push) Waiting to run Details Build and Push API & Web / create-manifest (api, DIFY_API_IMAGE_NAME, merge-api-images) (push) Blocked by required conditions Details Build and Push API & Web / create-manifest (web, DIFY_WEB_IMAGE_NAME, merge-web-images) (push) Blocked by required conditions Details	2024-11-12 21:51:09 +08:00
wakaka6	9c7edb9242	feat: add builtin tools for send email (#10493 )	2024-11-12 21:48:36 +08:00
Benjamin	0867821ae7	fix: update conversation session naming and API path in documentation (#10589 )	2024-11-12 21:44:04 +08:00
Jyong	0b2d51d859	add the index field for elasticsearch (#10592 )	2024-11-12 21:43:16 +08:00
方程	ef8022f715	Gitee AI Qwen2.5-72B model (#10595 )	2024-11-12 21:40:32 +08:00
Kevin9703	e03ec0032b	fix: Azure OpenAI o1 max_completion_token error (#10593 )	2024-11-12 21:40:13 +08:00