Empowering One-Stop Construction of the Big Data Ecosystem

SDC Hadoop Data Storage and Computing Software

SDC Hadoop Data Storage and Computing Software is a fundamental big data tool based on distributed technology. It provides users with massive data storage, computing, and analysis capabilities.

Capabilities & Features

Dynamic Classification Unified Storage

1、Supports unified storage of structured, semi-structured, and unstructured data. 2、Supports policy-based storage configuration for dynamic classification storage of small,medium, and large files. 3、Supports custom configuration of storage archiving policies for rapid archiving management and intelligent retrieval of data assets.

Multi-source Heterogeneous Fusion Computing

1、Supports various data sources, including relational databases, distributed databases, NoSQL databases, file systems, message queues, etc. 2、Supports graphical task design and orchestration for data mart topic models and data fusion. 3、Based on the multi-source heterogeneous fusion computing engine, supports intelligent perception of materialized views. 4、Supports various types of data operations and processing functions, such as relational operators, arithmetic operators, string processing, etc. 5、Supports standalone and distributed deployment modes.

Enterprise-Grade Intelligent Data Warehouse

1、Provides a unified SQL parsing layer for access and operations, supporting P/LSQL, TSQL syntax; compatible with SQL2003, SQL99, SQL92 standards; supports Oracle, SQL Server dialects and stored procedures. 2、Supports wizard-based construction of enterprise data warehouses, providing graphical design for data warehouse layering. 3、To facilitate rapid data warehouse construction, supports online SQL IDE for SQL script debugging and data preview. 4、Supports distributed transaction mechanisms and batch CRUD (Create, Read, Update,Delete) operations to ensure data operation ACID compliance.

Visual Orchestration for Real-time Computing

1、Supports visual orchestration of real-time computing tasks, providing various data stream processing operators. 2、Supports micro-batch event processing and event-driven processing modes for stream computing. Also supports CEP (Complex Event Processing). 3、Supports configuration, monitoring, and management of stream-based machine learning,user-defined functions (UDFs), and real-time computing tasks.

Self-Service Intelligent Search

1、Supports online preview and data download for structured and unstructured data; search performance for billions of records achieves millisecond-level response. 2、Supports multi-tiered storage (memory, SSD, SATA/SAS disks) and SSD hardware acceleration for hot, warm, and cold data. 3、Supports standard SQL for multi-dimensional combined exact queries, fuzzy queries,keyword, phrase, and wildcard rapid matching. 4、For building enterprise knowledge bases, supports rapid data archiving and full-text retrieval based on the unified data storage service.

Competitive Edge

High-Performance Intelligent Adaptive Computing Engine

Combining data distribution characteristics, it achieves balanced data loads across computing nodes by statistically analyzing execution information during computation and dynamically adjusting data shards. This results in computational capabilities several times higher than the open-source version in distributed computing scenarios, effectively enhancing data processing power. In multi-task parallel environments, it provides faster distributed computing speeds. For ultra-large cluster scales, it ensures more stable cluster operation.

Unified Access and Operation SQL Engine

The scenario-based SDC SQL engine avoids the data scenario limitations caused by the tight coupling of SQL engines, computing engines, and storage engines. By decoupling the SQL engine from the computing engine, it broadly supports numerous data computing scenarios,including interactive query analysis, OLAP, OLTP, data retrieval, real-time computing, and graph analysis. Concurrently, the SDC SQL engine supports standard SQL 99, Oracle, SQL Server dialects, and stored procedures, providing users with a unified SQL parsing layer for data access and operations. To adapt to computational tuning for different data tasks, the SDC SQL engine supports tuning configurations based on sessions or individual SQL statements, offering a more flexible configuration model for user task execution.

Fusion Computing for Multi-source Heterogeneous Data

Multi-source heterogeneous data fusion computing, as a next-generation fusion computing engine, redefines the traditional data processing model for offline data fusion computing. Based on an intelligent SQL engine, it extends SQL syntax. Leveraging the distributed fusion computing engine, it supports online fusion calculations across different data source types and data types.Furthermore, to meet the data modeling needs of data warehouses and data marts, it supports intelligent perception and scheduling of materialized views based on the distributed fusion computing engine, providing a graphical drag-and-drop tool for rapid, batch construction of data models.

Intelligent Planning for Large-Scale Clusters

To reduce the complexity and deployment costs associated with planning layered, multi-domain large-scale clusters, it provides intelligent planning and one-click cluster installation/deployment based on service groups, roles, services, and node loads. Simultaneously, to meet the needs of unified operation, maintenance, and security control, it offers visual O&M management, unified log management, a web-based command-line shell tool, Kerberos authentication, KMS data encryption, and an RBAC user permission management model. It supports fine-grained data access control to ensure the system and data are protected from malicious attacks and security threats.

Application Scenario

Government Sector Application Scenarios
To support the construction of fundamental public data resource libraries and government big data demonstration applications, and to help governments efficiently compute and retrieve existing data, SDC Hadoop provides distributed file storage technology, computing frameworks,and search engines. This enables distributed execution of computing tasks, deep real-time data search, and multi-dimensional data presentation.
Coal Industry Application Scenarios
To implement national decisions and deployments, and promote the healthy development of the coal big data industry, Sifang Weiye proposes a big data solution for coal regulatory information: Leveraging big data technology to mine massive data, explore patterns, accurately predict industry trends, ensure production safety, and facilitate industry transformation, upgrading, and healthy sustainable development.

User Benefits

Distributed Computing for Massive Data

Provides one-stop big data products, technical services, and solution support capabilities for various data analysis and computing scenarios.

Unified Storage for All Data Types

Helps government and industry users efficiently store, archive, and manage various types of data, providing a unified data storage service.

Application Scenarios

Large-Screen Visualization System for Network Control Center of Urumqi Rail Transit Industry Headquarters

The Rail Transit Command Center serves as the central coordination hub for the entire urban rail network, responsible for synchronizing operations between line control centers and operating entities. Its functions encompass comprehensive monitoring, multi-line and multi-system operational coordination, emergency command, and information sharing. In routine scenarios, it undertakes coordination and assistance tasks; in abnormal situations, it provides robust emergency command capabilities.


Learn more
National Grain Consultation Center Visualization

The State Administration of Grain initiated the construction of the National Grain Management Platform to standardize daily grain depot operations and business processes, enhance information transparency and centralization, and enable administrative authorities to remotely monitor grain reserves, stock conditions, quality, production safety, market trends, and depot operations nationwide without leaving their offices.


Learn more
A City Airport Visualization Platform

Develop an integrated visualization and monitoring platform tailored to the airport group’s characteristics. Leveraging big data, IoT sensing, virtual simulation, and digital twin technologies, the platform enables intelligent, precise, and scientific management of personnel, aircraft, and equipment. This enhances operational efficiency, management levels, and decision-making capabilities to support comprehensive airport control.

Learn more

Begin Your Data Intelligence Journey Today

Consultation Free Trial