Data loading
🗃️ Loading overview
5 items
🗃️ Load from Object Storage
8 items
📄️ Load data from a local file system
How to load data from a local file system into StarRocks using Stream Load or the curl command, supporting CSV, JSON, Parquet, and ORC formats.
📄️ Load data from HDFS
How to load data from HDFS into StarRocks using INSERT+FILES(), Broker Load, or Spark Load.
🗃️ Load data from Apache Kafka
2 items
🗃️ Load data from Apache Spark
2 items
📄️ Load data using INSERT
How to load data into StarRocks using INSERT statements: INSERT VALUES, INSERT INTO SELECT, INSERT OVERWRITE, Dynamic Overwrite, and loading from files using FILES().
📄️ SQL Transaction
How to use SQL transactions in StarRocks to commit multiple DML statements (INSERT, UPDATE, DELETE) atomically with ACID guarantees across multiple tables.
📄️ Load data using Stream Load transaction interface
How to use the Stream Load transaction interface (two-phase commit) in StarRocks for highly concurrent stream loads from Flink or Kafka with multi-table transaction support.
📄️ Realtime synchronization from MySQL
How to use Flink CDC to capture MySQL change data in real time and synchronize it into StarRocks tables using the StarRocks Flink connector.
📄️ Continuously load data from Apache Flink®
How to use the StarRocks Flink connector to continuously load data from Apache Flink into StarRocks, supporting DataStream API, Table API & SQL, and Python API.
📄️ Continuously load data from Apache® Pulsar™
How to use Routine Load to continuously load data from Apache Pulsar topics into StarRocks, with CSV and JSON format support.
📄️ Change data through loading
How to perform INSERT, UPDATE, DELETE, partial update, and conditional update on Primary Key tables in StarRocks using Stream Load, Broker Load, or Routine Load.
📄️ Transform data at loading
How to transform data during loading in StarRocks: column mapping, filter conditions, and derived column expressions for Stream Load, Broker Load, and Routine Load.
📄️ Load data using tools
Overview of third-party tools for loading data into StarRocks: SMT, DataX, DBT, CloudCanal, and other ecosystem integrations.
📄️ Strict mode
How strict mode affects data loading in StarRocks: what it controls, how to enable it, and when to use it to reject rows with invalid column type conversions.