DBLab Engine (DLE) is used to boost software development and testing processes by enabling ultra-fast provisioning of databases of any size. In this tutorial, we will install DBLab Engine from the AWS Marketplace. If you are an AWS user, this is the fastest way to have powerful database branching for any database, including RDS and RDS Aurora. But not only RDS: any Postgres and Postgres-compatible database can be a source for DLE.
info
Currently, the AWS Marketplace version of DLE focuses on the "logical" data provisioning mode (dump/restore) – the only possible method for managed PostgreSQL cloud services such as RDS Postgres, RDS Aurora Postgres, Azure Postgres, or Heroku. "Physical" mode (obtaining databases at the file level) is also supported in DLE but requires additional efforts – namely, editing the DLE configuration file manually. More information about various data retrieval options can be found here.
Compared to traditional RDS clones, Database Lab clones are instant. RDS cloning takes several minutes, and, depending on the database size, additional dozens of minutes or even hours may be needed to "warm up" the database (see "Lazy load"). Obtaining a new DLE clone takes as low as a few seconds, and it does not increase storage and instance bill at all.
A single DLE instance can be used by dozens of engineers or CI/CD pipelines – all of them can work with dozens of thin clones located on a single instance and single storage volume. RDS Aurora clones are also "thin" by nature, which could be great for development and testing. However, each Aurora clone requires a provisioned instance, increasing the "compute" part of the bill; IO-related charges can be significant as well. This makes Aurora clones less attractive for the use in non-production environments. The use of DLE clones doesn't affect the bill anyhow – both "compute" and "storage" costs remain constant regardles of the number clones provisioned at any time.
Typical "pilot" setupTimeline:
Outcome:
shared_buffers = 1GB
for each clone): ~30 clonesFirst steps to install DLE from the AWS Marketplace are trivial:
And press the "View purchase options" button:
Then, press "Subscribe":
Next, press "Launch your software":
Now, check that the DBLab Engine version (the latest is recommended) and the AWS regions are chosen correctly, then press "Continue to Launch":
On this page you need to choose "Launch CloudFormation" and press "Launch":
This page should be left unmodified, just press the "Next" button:
Now, it is time to fill the form that defines the AWS resources that we need:
0.0.0.0/0
; for production use, restrict connections wisely);Next, on the same page:
Then, press "Next".
At the bottom of the next page acknowledge that AWS CloudFormation might create IAM resources. Then, press the "Next" button:
Once you've pressed "Submit", the process begins.
You need to wait a few minutes while all resources are being provisioned. Check out the "Outputs" section periodically. Once DLE API and UI are ready, you should see the ordered list of instructions on how to connect to UI and API.
Step 2. Configure and launch the DBLab EngineEnter the verification token, you have created earlier. You can also find it in the "Outputs" section.
Now it's time to define DB credentials of the source to initiate database provisioning – this is how DLE will be initialized, performing the very first data retrieval, and then the same parameters will be used for scheduled full refreshes according to the schedule defined. Fill the forms, and use the information in the tooltips if needed.
Then press "Test connection". If your database is ready for dump and restore, save the form and press "Switch to Overview" to track the process of data retrieval.
In the Overview tab, you can see the status of the data retrieval. Note that the initial data retrieval takes some time – it depends on the source database size. However, DLE API, CLI, and UI are already available for use. To observe the current activity on both source and target sides use "Show details".
Once the retrieval is done, you can create your first clone. Happy cloning!
Video demonstration of steps 1 and 2 Need to start over? Here is howIf data provisioning fails, you can always:
If something went south in general and you need a fresh start, go back to AWS CloudFormation and delete your stack; then start from the very beginning of this tutorial
Getting supportWith DLE installed from AWS Marketplace, the guaranteed vendor support is included – please use one of the available ways to contact.
TroubleshootingTo troubleshoot:
sudo docker ps
(to see all containers including the stopped ones: sudo docker ps -a
)sudo docker logs -f dblab_server
(the same logs you can observe in UI – the "Logs" tab)/var/lib/dblab/dblab_pool/dataset_1/data/log
for the first snapshot of the database, in /var/lib/dblab/dblab_pool/dataset_2/data/log
for the second one (if it's already fetched); if you have configured DLE to have more than 2 snapshots, check out the other directories too (/var/lib/dblab/dblab_pool/dataset_$N/data/log
, where $N
is the snapshot number, starting with 1
)With DBLab, you can create safe, instant copies of your database: perfect for testing, experimenting, or trying out new ideas. In this step, you'll learn how to:
psql
installed on your working machine. In the terminal, type psql
and paste the psql connection string field contents. Change the database name DBNAME
parameter, you can always use postgres
for the initial connection.\d
.dblab
)
CLI can be used on any machine, you just need to be able to reach the DLE API (port 2345 by default). In this tutorial, we will install and use CLI locally on the EC2 instance.
curl -fsSL https://gitlab.com/postgres-ai/database-lab/-/raw/master/engine/scripts/cli_install.sh | bash
sudo mv ~/.dblab/dblab /usr/local/bin/dblab
Initialize CLI configuration (assuming that localhost:2345
forwards to DLE machine's port 2345):
dblab init \
--environment-id=tutorial \
--url=http://localhost:2345 \
--token=secret_token \
--insecure
Check the configuration by fetching the status of the instance:
Create a clonedblab clone create \
--username dblab_user_1 \
--password secret_password \
--id my_first_clone
After a second or two, if everything is configured correctly, you will see that the clone is ready to be used. It should look like this:
{
"id": "botcmi54uvgmo17htcl0",
"snapshot": {
"id": "dblab_pool@initdb",
"createdAt": "2020-02-04T23:20:04Z",
"dataStateAt": "2020-02-04T23:20:04Z"
},
"protected": false,
"deleteAt": "",
"createdAt": "2020-02-05T14:03:52Z",
"status": {
"code": "OK",
"message": "Clone is ready to accept Postgres connections."
},
"db": {
"connStr": "host=111.222.000.123 port=6000 user=dblab_user_1",
"host": "111.222.000.123",
"port": "6000",
"username": "dblab_user_1",
"password": ""
},
"metadata": {
"cloneDiffSize": 479232,
"cloningTime": 2.892935211,
"maxIdleMinutes": 0
},
"project": ""
}
Connect to a clone
You can work with the clone you created earlier using any PostgreSQL client, for example, psql
. To install psql
:
sudo apt-get install postgresql-client
Use connection info (the db
section of the response of the dblab clone create
command):
PGPASSWORD=secret_password psql \
"host=localhost port=6000 user=dblab_user_1 dbname=test"
Check the available tables:
Now let's see how quickly we can reset the state of the clone. Delete some data or drop a table. Do any damage you want! And then use the clone reset
command (replace my_first_clone
with the ID of your clone if you changed it). You can do it not leaving psql
– for that, use the \!
command:
\! dblab clone reset my_first_clone
Check the status of the clone:
\! dblab clone status my_first_clone
Notice how fast the resetting was, just a few seconds! 💥
Reconnect to the clone:
Now check the database objects you've dropped or partially deleted – the "damage" has gone.
Create a snapshot UILet's capture a snapshot of your database state. Think of it as creating a checkpoint - you can always return to this exact moment later, or use it as a starting point for new experiments.
You can now use the snapshot as a base for new clones or branches!
Pro tip: You can also create snapshots directly from any clone page - just look for the "Create snapshot" button there.
CLIYou can also create snapshots from the CLI:
dblab commit --clone-id my_first_clone --message "Snapshot message"
Replace my_first_clone
with your actual clone ID (you can see it in the UI or use dblab clone list
to list them).
Want to see all your snapshots? Just run:
Instant database branching UINow let's create your first branch! Branches let you experiment safely, try new features, or test fixes in your own isolated environment.
main
by default) and/or a snapshot that will be memorized as a forking point.Your new branch is now ready for you to use!
CLITo create a branch from the command line:
dblab branch my_first_branch
By default, the current branch will be the parent of the new one. You can see your current branch, along with other existing branches, using this command:
Additionally, you can specify the parent branch or the snapshot when creating a branch:
dblab branch --parent-branch my_first_branch my_second_branch
dblab branch --snapshot-id SNAPSHOT_ID my_first_branch
Try it out: Create a branch, make some changes to your data, then create another branch from that point. You're now branching like a pro! 🌿
For more, see the full client CLI reference.
RetroSearch is an open source project built by @garambo | Open a GitHub Issue
Search and Browse the WWW like it's 1997 | Search results from DuckDuckGo
HTML:
3.2
| Encoding:
UTF-8
| Version:
0.7.4