Skip to main content
Version: 2.2

Create a metadata migration

Metadata migrations transfer existing metadata, as well as any subsequent changes made to the source metadata (in the scope of the migration), while Hive Migrator keeps working.

caution

If you're using MariaDB or MySQL, add the JDBC driver to the classpath manually.

note

Wildcards support
When creating patterns for your migration, note the following:

  • If using Hive 1, 2, or 3: Only use patterns with the wildcards * and |.

    For example, using --database-pattern test* will match any database with "test" at the beginning of its name, such as test01, test02, test03.

  • If using Hive 4: Use any wildcards based on Hive's Data Definition Language (DDL).

Create a metadata migration with the UI

info

Ensure you have migrated the data for the databases and tables you want to migrate.

You need both the data and associated metadata before you can successfully run queries on migrated databases.

  1. From your Dashboard, select the product under Products.

  2. Under Metadata Migrations, select Create metadata migration.

  3. Enter a name for this migration.

  4. Select a source and target agent.

  5. Create a Database Pattern and Table Pattern that match the databases and tables you want to migrate.

  6. (Optional) - Use the options under Target Agent Configuration Overrides to override your Databricks target agent configuration for this migration.

    • Catalog Enter the name of your Databricks Unity Catalog.

    • Convert to delta format Select to convert your tables to Delta Lake format after migrating to Databricks.

    • Delete after conversion Select to delete the underlying table data and metadata from the Filesystem Mount Point location after it has been converted to Delta Lake in Databricks.

      info

      Only use this option if you're performing one-time migrations for the underlying table data. The Databricks agent doesn't support continuous (live) updates of table data if you're converting to Delta Lake in Databricks.

    • Filesysytem Mount Point The filesystem that contains your data you want to migrate must be mounted onto your DBFS.
      Enter the mounted container's path on the DBFS.

    • (Optional) - Enter another path for Default Filesystem Override.

      1. If you select Convert to Delta Lake , enter the location on the DBFS to store the tables converted to Delta Lake. To store Delta Lake tables on cloud storage, enter the path to the mount point and the path on the cloud storage.

        Example: Location on the DBFS to store tables converted to Delta Lake
        dbfs:<location>
        Example: Cloud storage location
        dbfs:/mnt/adls2/storage_account/
      2. If you don't select Convert to Delta, enter the mount point.

        Example: Filesystem mount point
        dbfs:<value of Fs mount point>
  7. Select Start migration automatically to start the migration automatically or leave clear to start manually after creation.

  8. Select Create to create the metadata migration.

info

Metrics shown on the metadata migration content summary page don’t show correct results following certain metadata migration failures. See the Known issue for more information.