Get started with a free trial today
Already have an account? Sign in
A high-level look at Stitch's Amazon MySQL RDS (v1) integration, including release status, useful links, and the features supported in Stitch.
| STITCH | |||
| Release status | Deprecated on November 8, 2021 | Supported by | |
| Stitch plan | Standard | Supported versions | n/a |
| API availability | Not available | Singer GitHub repository | |
| CONNECTION METHODS | |||
| SSH connections | Supported | SSL connections | Supported |
| REPLICATION SETTINGS | |||
| Anchor Scheduling | Supported | Advanced Scheduling | Supported |
| Table-level reset | Supported | Configurable Replication Methods | Supported |
| REPLICATION METHODS | |||
| Log-based Replication | Supported | Key-based Replication | Supported |
| Full Table Replication | Supported | ||
| DATA SELECTION | |||
| Table selection | Supported | Column selection | Supported |
| View replication | Supported | Select all | Supported, with prerequisites |
| TRANSPARENCY | |||
| Extraction Logs | Supported | Loading Reports | Supported |
To set up Amazon MySQL RDS in Stitch, you need:
Privileges in Amazon Web Services (AWS) that allow you to:
TheCREATE USER orINSERT privilege (for themysql database). TheCREATE USER privilege is required to create a database user for Stitch.
TheGRANT OPTION privilege in Amazon MySQL RDS. TheGRANT OPTION privilege is required to grant the necessary privileges to the Stitch database user.
First, you’ll log into Stitch and verify the data pipeline region your account is using. Later in this guide, you’ll need to grant Stitch access by whitelisting our IP addresses.
The IP addresses you’ll whitelist depend on theData pipeline region your account is in.
Locate the list of IP addresses for your region:
Keep this list handy - you’ll need it later.
In this step, you’ll configure the database server to allow traffic from Stitch to access it. There are two ways to connect your database:
Click the option you’re using below and follow the instructions.
For Stitch to successfully connect with your database instance, you’ll need to add our IP addresses to the appropriate Security Group via the AWS management console.
Security Groups must reside in the same VPC as the instance. Use the instructions below to create a security group for Stitch and grant access to the VPC.
StitchCustom TCP Rule3306 by default)Repeat steps 6-8 until all the IP addresses for your Stitch data pipeline region have been added.
This is what a Security Group using Stitch’s North America IP addresses looks like:

Important:Requirements for configuring binlog replication
To use binlog replication, your Amazon MySQL RDS database must be running MySQL version 5.6.2 or greater.
Additionally, setting up binlog replication requires rebooting your database to ensure parameter changes take effect. To minimize disruptions, we recommend performing the reboot during non-peak usage hours.
While Log-based Incremental Replication is the most accurate and efficient method of replication, using this replication method may, at times, require manual intervention or impact the source database’s performance. Refer to theLog-based Incremental Replication documentation for more info.
You can also use one of Stitch’s otherReplication Methods, which don’t require any database configuration. Replication Methods can be changed at any time.
In this step, you’ll configure the database parameters required to use Log-based Incremental Replication.
Locate the parameters in the list below, and enter the required values into theValues column:
binlog_format:ROW
binlog_row_image:FULL
log_slave_updates:1
Note:log_slave_updates is applicable only if you are connecting a read replica to Stitch. If you aren’t connecting a read replica, you don’t have to define this parameter.

In the table below are the names, required values, and descriptions of the server settings you must define.
| Setting | Value | Description |
| binlog_format | ROW | Note: This setting is available on Amazon MySQL RDS databases running version 5.6.2 or greater. Defines the binary logging format. A Stitch supports the following event types:
|
| binlog_row_image | FULL | Note: This setting is available on Amazon MySQL RDS databases running version 5.6.2 or greater. Defines how row images are written to the binary log. A |
| log_slave_updates | 1 | Indicates whether updates received by a read replica from a master server should be logged to the replica’s own binary log. Note: This is applicable when using a read replica. |
The backup retention period setting defines the number of days for which automated backups are retained. This ensures that data can still be replicated even if a job is interrupted, there’s database or Stitch downtime, etc.
Navigate back to theDatabases page by using the menu on the left side of the page.
On theModify DB Instance page, scroll down to theBackup section.
SetBackup retention period to anything greater than1 day:

Navigate to the Instance Details page and locate theParameter group. Initially, the Parameter group should sayapplying.
When it changes topending-reboot, you can reboot the database and apply the changes.
Rebooting the instance will take a few minutes. When the status of theparameter group changes toin-sync and theDB instance status (located at the top of the Instance Details page) changes toavailable, the reboot will be complete:

Required for Log-based Replication
This step is required if using Log-based Replication and any of the following are true:
When Stitch connects to your database and uses Log-based Replication, a unique server ID will be required. This ID ensures that the integration - or integrations, if you’re connecting multiple databases - will not encounter conflicts during the replication process.
To avoid conflicts, you’ll check which server IDs are currently in use and then define a new, unqiue ID in Stitch.
Run the following statement:
mysql>SHOWSLAVEHOSTS;TheSHOW SLAVE HOSTS statement will return information about servers that are or have been connected as replication slaves:
+------------+-------------+------+-----------+------------+|Server_id|Host|Port|Master_id|Slave_UUID|+------------+-------------+------+-----------+------------+|192168010|stitch_prod|3306|192168011|<UUID>||1921680101|stitch_dev|3306|192168011|<UUID>|+------------+-------------+------+-----------+------------+When you complete the setup in Stitch, you’ll define a unique server ID for your Stitch Amazon MySQL RDS integration to use.
In addition to thebackup retention period, you also need to define thebinlog retention hours setting. This parameter specifies the number of hours to the database server should retain binary logs.
To specify the number of hours, use themysql.rds_set_configuration procedure when logged into the Amazon MySQL RDS master instance.
In this example, the logs will be retained for seven days (24 x 7 = 168):
call mysql.rds_set_configuration('binlog retention hours', 168);Stitch recommends a minimum of three days for the retention period, but strongly recommend seven.Note: The maximumbinlog retention hours value for Amazon MySQL RDS databases is 168 hours (seven days).
CREATE USER andGRANT OPTION privileges to complete this step.Next, you’ll create a dedicated database user for Stitch. This will ensure Stitch is visible in any logs or audits, and allow you to maintain your privilege hierarchy.
CREATE USER andGRANT OPTION privileges.Run the following command to create the Stitch database user:
CREATEUSER'<stitch_username>'@'localhost'IDENTIFIEDBY'<password>';Replace<password> with a secure password. If using SSH, this can be different than the SSH password.
Grant the Stitch userSELECT privileges by running this command for every table you want to replicate:
GRANTSELECTON'<database_name>'.'<table_name>'to'<stitch_username>'@'localhost';Limiting access to only the tables you want to replicate ensures that the integration can complete discovery (a structure sync) in a timely manner. If you encounter issues in Stitch where tables aren’t displaying, try limiting the Stitch database user’s table access.
Note: Column-level permissions are not supported for use with Log-based Incremental Replication. Restricting access to columns will cause replication issues.
Important:Using Log-based Incremental Replication
Additionally, if you want to use Log-based Incremental Replication, you’ll also need to grant the Stitch user replication privileges:
GRANTREPLICATIONCLIENT,REPLICATIONSLAVEON*.*TO'<stitch_username>'@'localhost';See thePrivileges list tab for an explanation of why these permissions are required by Stitch.
In the table below are the database user privileges Stitch requires to connect to and replicate data from a Amazon MySQL RDS database.
| Privilege name | Reason for requirement |
| SELECT | Required to select rows from tables in a database. |
| REPLICATION CLIENT | Required for binlog replication. Required to use |
| REPLICATION SLAVE | Required for binlog replication. Required to use |
In this step, you’ll complete the setup by entering the database’s connection details and defining replication settings in Stitch.
On theRDS Dashboard page, click theDatabases option on the left side of the page. This will open theRDS Databases page.
In the list of databases, locate and click on the instance you want to connect to Stitch. This will open theDatabase Details page.
On the Database Details page, scroll down to theConnectivity & security section.
DB Name: This field contains the name of the database used to launch the instance. You’ll only need this info if you want to connect this specific database to Stitch.
You can connect this database to Stitch, or another database within Amazon MySQL RDS.
Leave this page open for now - you’ll need it to complete the setup.
On the Stitch Dashboard page, click theAdd Integration button.
Fill in the fields as follows:
Integration Name: Enter a name for the integration. This is the name that will display on the Stitch Dashboard for the integration; it’ll also be used to create the schema in your destination.
For example, the name “Stitch Amazon MySQL RDS” would create a schema calledstitch_amazon_mysql_rds in the destination.Note: The schema name cannot be changed after the integration is saved.
Integration Name: Paste theEndpoint address from the Amazon MySQL RDS Details page in AWS into this field. Don’t include the port number, if it’s appended to the end of the endpoint string - this will cause errors.
Port: Enter the port used by the Amazon MySQL RDS instance. The default is3306.
Username: Enter the Stitch Amazon MySQL RDS database user’s username.
Password: Enter the password for the Stitch Amazon MySQL RDS database user.
If you’re using an SSH tunnel to connect your Amazon MySQL RDS database to Stitch, you’ll also need to define the SSH settings. Refer to theSetting up an SSH Tunnel for a database in Amazon Web Services guide for assistance with completing these fields.
Click theSSH Tunnel checkbox.
Fill in the fields as follows:
SSH Host: Paste thePublic DNS of the SSH sever (EC2 instance) into this field. Refer to theAmazon SSH guide for instructions on retrieving this info.
SSH Port: Enter the SSH port of the SSH server (EC2 instance) into this field. This will usually be22.
SSH User: Enter the Stitch Linux (SSH) user’s username.
Check theConnect using SSL checkbox.Note: The database must support and allow SSL connections for this setting to work correctly.
Fill in the fields as follows:
SSL Certificate: The certificate (typically a CA or server certificate) Stitch should verify the SSL connection against. The connection will succeed only if the server’s certifcate verifies against the certificate provided.
Note: Providing a certifcate via this property isn’t required to use SSL. This is only if Stitch should verify the connection against a specific certificate.
Use an SSL client key: If SSL client authentication should be used, check this box. This will display theClient Certificate andClient Key fields, which are both required when using client authentication.
Client Certificate: If using SSL client authentication, paste the client certificate Stitch should use into this field.Note: You must also provide aClient Key for the connection to be successful.
Client Key: If using SSL client authentication, paste the client key Stitch should use into this field.Note: You must also provide aClient Certificate for the connection to be successful.
In theLog-based Replication section, you can set this as the integration’s default Replication Method.
When enabled, tables that are set to replicate will useLog-based Incremental Replication by default. If you don’t want a table to use Log-based Incremental Replication, you can change it in the Table Settings page for that table.
If this setting isn’t enabled, you’ll have to select aReplication Method for each table you set to replicate.
In theReplication Frequency section, you’ll create the integration’sreplication schedule. An integration’s replication schedule determines how often Stitch runs a replication job, and the time that job begins.
Amazon MySQL RDS integrations support the following replication scheduling methods:
Advanced Scheduling using Cron (Advanced or Premium plans only)
To keep your row usage low, consider setting the integration to replicate less frequently. See theUnderstanding and Reducing Your Row Usage guide for tips on reducing your usage.
When finished, clickCheck and Save.
Stitch will perform a connection test to the Amazon MySQL RDS database; if successful, aSuccess! message will display at the top of the screen.Note: This test may take a few minutes to complete.
The last step is to select the tables and columns you want to replicate.
Note: If a replication job is currently in progress, new selections won’t be used until the next job starts.
For Amazon MySQL RDS integrations, you can select:
Individual tables and columns
All tables and columns (except views)
Database views
Click the tabs to view instructions for each selection method.
Click thecheckbox next to the table’s name. A blue checkmark means the table is set to replicate.
After you set a table to replicate, a page with the table’s columns will display. De-select columns if needed.
Define the table’sReplication Method, or skip this step if you want to use theintegration’s default method.
If usingKey-based Incremental Replication, select aReplication Key.
When finished, clickUpdate Settings.
Repeat this process for every table you want to replicate.
Important: Before using this feature, note that:
Using theSelect All feature will overwrite any previous selections. However, selections aren’t final untilFinalize Your Selections is clicked. ClickingCancel will restore your previous selections.
Log-based Incremental Replication must be enabled and set as the default Replication Method to use the Select All feature.
Refer to theSelect All guide for more info about this feature.
Click theTables to Replicate tab.
Navigate to the table level, selecting any databases and/or schemas that contain tables you want to replicate.
In the menu that displays, clickTrack AllTables and Fields (Except Views):

Setting a database view to replicate is similar to selecting a table, with a few differences. Refer to theReplicating Database Views guide for detailed instructions.
At a high level, you’ll need to complete the following to select a database view:
After you finish setting up Amazon MySQL RDS, itsSync Status may show asPending on either the Stitch Dashboard or in the Integration Details page.
For a new integration, aPending status indicates that Stitch is in the process of scheduling the initial replication job for the integration.This may take some time to complete.
Initial replication jobs with Anchor Scheduling
If using Anchor Scheduling, an initial replication job may not kick off immediately. This depends on the selected Replication Frequency and Anchor Time. Refer to theAnchor Scheduling documentation for more information.
The first seven days of replication, beginning when data is first replicated, are free. Rows replicated from the new integration during this time won’t count towards your quota. Stitch offers this as a way of testing new integrations, measuring usage, and ensuring historical data volumes don’t quickly consume your quota.
| Related | Troubleshooting |
Did this article help? If you have questions or feedback, feel free tosubmit a pull request with your suggestions,open an issue on GitHub, orreach out to us.