How to Migrate Your Oracle Database to AWS RDS Using S3 ~ Blog for oracle RAC Goldengate Dataguard ASM database

Hello Friends ,

Tired of maintaining bulky on-premises Oracle servers? The cloud promises agility and scale, but migrating mission-critical databases can feel like a high-wire act. What if you had a secure, proven path to the cloud?

If you're leading a modernization charge, you've likely hit this crossroads. Your Oracle databases are the backbone of your operations, and any migration must be fast, secure, and reliable. The good news? You don't have to reinvent the wheel.

In this guide, we’ll walk through the industry-favored method for large-scale Oracle migrations: using Oracle Data Pump integrated with Amazon S3. This isn't just a theoretical overview—it's a practical, step-by-step blueprint to get your data safely to Amazon RDS.

Why Choose This Migration Path? Your Options Explained

Before we dive into the "how," let's quickly review the "why." When moving from on-premises Oracle to AWS, you have several tools at your disposal:

Oracle SQL Developer: Great for small, simple databases (think under 50 MB). It's a handy UI-driven tool but not cut out for heavy lifting.

AWS Database Migration Service (DMS): The go-to for heterogeneous migrations (e.g., Oracle to PostgreSQL) or when you need continuous, real-time replication with minimal downtime.

Oracle Data Pump with Amazon S3: This is the champion for large, complex Oracle databases—from hundreds of megabytes to terabytes. It offers fine-grained control, high performance through parallelism, and is a native Oracle tool, making it a trusted choice for mission-critical data.

So, when should you use the Data Pump & S3 method?

You're dealing with a large dataset.

You need parallel processing for speed.

You require precise control over schemas and tablespaces.

The Game Plan: Your Migration Workflow in a Nutshell

Think of this migration as a well-orchestrated relay race. Each step passes the baton cleanly to the next. Here’s the high-level view:

Export: Create a snapshot of your schema from the on-premises database using Data Pump.

Transfer: Upload the snapshot to an Amazon S3 bucket—your secure cloud staging area.

Prepare: Set up the target environment on Amazon RDS for Oracle.

Import: Pull the data from S3 and import it into RDS using Data Pump's powerful API.

Validate: Confirm that everything moved correctly and is working as expected.

![Visual description: A simple flowchart showing five boxes connected by arrows: On-Prem Oracle DB -> Export via expdp -> Amazon S3 Bucket -> Amazon RDS Oracle -> Validation Check. This illustrates the seamless migration workflow.]

Now, let's get our hands dirty.

Prerequisites: Laying the Groundwork

A smooth migration starts with proper preparation. Before running a single command, ensure you have:

An Amazon S3 Bucket: Created in the exact same AWS Region as your future RDS instance. This minimizes latency and costs.

IAM Roles for S3 Access: Your RDS instance needs permission to read from S3. This involves creating an IAM policy that allows S3 read/write actions and attaching it to an IAM role that your RDS instance can assume.

Storage Check: Verify you have enough disk space on your source database for the dump file and on your RDS instance for the imported data.

The Migration, Step-by-Step

Step 1: Export the Schema from Your On-Premises Database

We start by creating a consistent snapshot of the schema we want to move. Oracle Data Pump's expdp utility is perfect for this.

Pro Tip: Using FLASHBACK_TIME=SYSTIMESTAMP is like taking a perfectly timed photograph of your database. It ensures a transactionally consistent snapshot, so you don't have to worry about changes during the export.


expdp system/password DIRECTORY=DATA_PUMP_DIR \  
DUMPFILE=Sultan_onprem.dmp \  
LOGFILE=expdp_Sultan_onprem.log \  
SCHEMAS=Sultan \  
FLASHBACK_TIME=SYSTIMESTAMP
SCHEMAS=Sultan: Exports only the Sultan schema, reducing migration time and complexity.

DIRECTORY=DATA_PUMP_DIR: Points to the directory object where the dump file will be created.

Step 2: Prepare the Landing Zone in Amazon RDS Amazon RDS is a managed service, which means you don't have OS-level access or SYSDBA privileges. You must pre-create the schema user and tablespace manually.

Think of this as preparing a new home for your data before the moving truck arrives.


CREATE USER Sultan IDENTIFIED BY "Sultan12345" DEFAULT TABLESPACE users;
GRANT CONNECT, RESOURCE, UNLIMITED TABLESPACE TO Sultan;

Step 3: The Handoff - Transfer the Dump File via S3

This is where the magic of AWS integration shines. First, upload the .dmp file to your S3 bucket using the AWS CLI or console.

Then, you'll pull the file directly from S3 into the RDS instance's DATA_PUMP_DIR. This bypasses the need for OS access, which RDS wisely restricts.


SELECT rdsadmin.rdsadmin_s3_tasks.download_from_s3(
    p_bucket_name => 'your-migration-bucket',
    p_s3_prefix => 'Sultan_onprem.dmp',
    p_directory_name => 'DATA_PUMP_DIR'
) AS task_id FROM dual;

You can check the status of this download task by querying the RDS logs, ensuring the file transferred completely before moving on.

Step 4: The Final Sprint - Import the Schema into RDS

Now for the main event. We use the DBMS_DATAPUMP PL/SQL API within RDS to import the schema. This method gives us maximum control.


DECLARE
    v_hdl NUMBER;
BEGIN
    -- Open a Data Pump job for a SCHEMA-mode import
    v_hdl := DBMS_DATAPUMP.OPEN(operation => 'IMPORT', job_mode => 'SCHEMA');

    -- Specify the dump file and log file
    DBMS_DATAPUMP.ADD_FILE(v_hdl, 'Sultan_onprem.dmp', 'DATA_PUMP_DIR', dbms_datapump.ku$_file_type_dump_file);
    DBMS_DATAPUMP.ADD_FILE(v_hdl, 'impdp_Sultan_onprem.log', 'DATA_PUMP_DIR', dbms_datapump.ku$_file_type_log_file);

    -- Filter to only import the Sultan schema
    DBMS_DATAPUMP.METADATA_FILTER(v_hdl, 'SCHEMA_EXPR', 'IN (''Sultan'')');

    -- Remap the on-prem tablespace to the one available in RDS
    DBMS_DATAPUMP.METADATA_REMAP(v_hdl, 'REMAP_TABLESPACE', 'APP_TBS', 'USERS');

    -- Speed things up with parallelism
    DBMS_DATAPUMP.SET_PARALLEL(v_hdl, 2);

    -- Start the job!
    DBMS_DATAPUMP.START_JOB(v_hdl);
END;
/

Breaking Down the Key Commands:

METADATA_FILTER: This is your precision tool, ensuring only the Sultan schema is imported.

METADATA_REMAP: This is crucial! It redirects database objects from your on-premises tablespace (e.g., APP_TBS) to the RDS-supported one (USERS).

SET_PARALLEL: This turns your single-threaded import into a multi-lane highway, dramatically speeding up the process for large datasets.

Step 5: Don't Skip This! Validate the Import

A migration isn't successful until you've verified it. Never skip the validation step.

Check the Logs: Always read the import log for errors.


SELECT text FROM table(rdsadmin.rds_file_util.read_text_file('DATA_PUMP_DIR','impdp_Sultan_onprem.log'));
Verify Objects: Confirm the user and all database objects (tables, procedures, etc.) are present and accounted for.


SELECT * FROM dba_users WHERE username='Sultan';
SELECT object_name, object_type, status FROM dba_objects WHERE owner='Sultan';

Key Takeaways for a Flawless Migration

Plan and Test: Run through this process in a non-production environment first. Test your network, your S3 permissions, and your storage estimates.

Embrace Parallelism: Using SET_PARALLEL is one of the easiest ways to cut down your migration window.

Remap Tablespaces: Remember that RDS doesn't support custom tablespaces like your on-prem environment. Remapping is not a suggestion; it's a requirement.

S3 is Your Best Friend: It provides a secure, scalable, and high-throughput bridge between your on-premises world and your new cloud home.

How to Migrate Your Oracle Database to AWS RDS Using S3

Related Post