Lakehouse AI Overview

Singdata Lakehouse AI is an intelligent analytics suite integrated within the data lakehouse platform, designed to help enterprises fully unlock data value and achieve a complete closed loop from data storage to intelligent decision-making. By natively integrating capabilities such as unstructured data management, multimodal retrieval, AI external functions, Python development framework, knowledge base services, and conversational analytics into the Lakehouse architecture, users can perform end-to-end intelligent operations including data discovery, model invocation, and predictive analytics on a unified platform.

  • Unstructured Data Discovery and Management: Automatically identify and index unstructured data such as documents, images, audio, and video, enabling unified metadata management and intelligent classification
  • AI External Functions: Encapsulate pre-trained models as SQL functions, enabling direct AI capability invocation within queries to simplify machine learning application development
  • Multimodal Retrieval: Support unified search across multiple data types including text, images, and vectors, enabling cross-modal intelligent semantic retrieval
  • AI + BI Unified Workflow: Seamlessly integrate traditional data processing workflows with AI data application workflows on the Lakehouse platform, achieving true data-intelligence convergence.
  • Python Language Interface (Zettapark): Provide a Python-like development framework, enabling data engineers to perform big data processing using their familiar language.
  • Knowledge Base Service (Beta): Provide an enterprise knowledge management platform based on vector databases, supporting document parsing, semantic search, and intelligent Q&A APIs.
  • Lakehouse Conversational Analytics DataGPT: Complete data query and analysis through natural language conversations, enabling business users to leverage data with zero barriers.
  • Lakehouse MCP Server (Beta): An instruction-as-a-service feature integrated with the AI ecosystem, enabling AI assistants to directly operate Lakehouse data resources.