Bring Diverse Data Sources Together with OCI Data Integration and HeatWave

Virtually all data types from any data source can be integrated into MySQL Database Service with Heatwave by using Oracle Cloud Infrastructure (OCI) Data Integration.

HeatWave is an in-memory query accelerator developed for the MySQL Database Service, and is available only in Oracle Cloud Infrastructure. MySQL Database Service with HeatWave enables you to run online transactional processing (OLTP) and online analytical processing (OLAP) workloads directly from a MySQL database without any changes to the applications. This eliminates the need for a separate analytics database.

OCI Data Integration is a cloud native, fully managed server-less ETL service for extracting, transforming, and loading data into data warehouses.

Architecture

This architecture uses Oracle Cloud Infrastructure (OCI) MySQL Database Service with Heatwave and OCI Data Integration. It shows how to use the same database instance for OLTP and OLAP workloads simultaneously.

In this architecture data from an on-premises MySQL database is transferred and loaded into MySQL Database Service on OCI.

The following diagram illustrates this reference architecture.

Description of architecture-integrate-heatwave.png follows
Description of the illustration architecture-integrate-heatwave.png

architecture-integrate-heatwave-oracle.zip

The architecture has the following components:

  • Region

    An Oracle Cloud Infrastructure region is a localized geographic area that contains one or more data centers, called availability domains. Regions are independent of other regions, and vast distances can separate them (across countries or even continents).

  • Availability domains

    Availability domains are standalone, independent data centers within a region. The physical resources in each availability domain are isolated from the resources in the other availability domains, which provides fault tolerance. Availability domains don’t share infrastructure such as power or cooling, or the internal availability domain network. So, a failure at one availability domain is unlikely to affect the other availability domains in the region.

  • Virtual cloud network (VCN) and subnets

    A VCN is a customizable, software-defined network that you set up in an Oracle Cloud Infrastructure region. Like traditional data center networks, VCNs give you complete control over your network environment. A VCN can have multiple non-overlapping CIDR blocks that you can change after you create the VCN. You can segment a VCN into subnets, which can be scoped to a region or to an availability domain. Each subnet consists of a contiguous range of addresses that don't overlap with the other subnets in the VCN. You can change the size of a subnet after creation. A subnet can be public or private.

  • Security list

    For each subnet, you can create security rules that specify the source, destination, and type of traffic that must be allowed in and out of the subnet.

  • Route table

    Virtual route tables contain rules to route traffic from subnets to destinations outside a VCN, typically through gateways.

  • Internet gateway

    The internet gateway allows traffic between the public subnets in a VCN and the public internet.

  • Dynamic routing gateway (DRG)

    The DRG is a virtual router that provides a path for private network traffic between a VCN and a network outside the region, such as a VCN in another Oracle Cloud Infrastructure region, an on-premises network, or a network in another cloud provider.

  • FastConnect

    Oracle Cloud Infrastructure FastConnect provides an easy way to create a dedicated, private connection between your data center and Oracle Cloud Infrastructure. FastConnect provides higher-bandwidth options and a more reliable networking experience when compared with internet-based connections.

  • VPN Connect

    VPN Connect provides site-to-site IPSec VPN connectivity between your on-premises network and VCNs in Oracle Cloud Infrastructure. The IPSec protocol suite encrypts IP traffic before the packets are transferred from the source to the destination and decrypts the traffic when it arrives.

  • Identity and Access Management (IAM)

    Use OCI Identity and Access Management policies to control who can access your cloud resources and what operations can be performed.

  • MySQL Database Service

    Oracle MySQL Database Service is a fully-managed database service that lets developers quickly develop and deploy secure, cloud-native applications using the world’s most popular open source database.

  • HeatWave

    HeatWave is a new, integrated, high-performance, in-memory query accelerator for MySQL Database Service that accelerates MySQL performance for analytics and transactional queries.

  • Compute VM

    The OCI Compute service enables you to provision and manage compute hosts in the cloud. You can launch Compute instances with shapes that meet your resource requirements for CPU, memory, network bandwidth, and storage. After creating a Compute instance, you can access it securely, restart it, attach and detach volumes, and delete it when you don't need it.

  • Oracle Object storage

    Object storage provides quick access to large amounts of structured and unstructured data of any content type, including database backups, analytic data, and rich content such as images and videos. Use standard storage for "hot" storage that you need to access quickly, immediately, and frequently. Use archive storage for "cold" storage that you retain for long periods of time and seldom or rarely access.

  • Data Integration Service

    Virtually all data types from any data source can be integrated into the MySQL Database Service and HeatWave using OCI Data Integration. When you need to collect data from diverse sources in a timely manner, Oracle Cloud Infrastructure (OCI) Data Integration, a cloud native, fully managed server-less ETL (extract, load, transform) service on Oracle Cloud Infrastructure , for extracting, loading, transforming data into data warehouses.

Recommendations

Your requirements might differ from the architecture described here. Use the following recommendations as a starting point.

  • VCN

    When you create a VCN, determine the number of CIDR blocks required and the size of each block based on the number of resources that you plan to attach to subnets in the VCN. Use CIDR blocks that are within the standard private IP address space.

    Select CIDR blocks that don't overlap with any other network (in Oracle Cloud Infrastructure, your on-premises data center, or another cloud provider) to which you intend to set up private connections.

    After you create a VCN, you can change, add, and remove its CIDR blocks.

    When you design the subnets, consider your traffic flow and security requirements. Attach all the resources within a specific tier or role to the same subnet, which can serve as a security boundary.

    Use regional subnets.

  • Object Storage

    This architecture uses standard Oracle Cloud Infrastructure Object Storage to store processed output so that other cloud services can access the output for further analysis and display.

Considerations

Consider the following points when deploying this reference architecture.

  • Resource Limits

    Consider the best practices, limits by service, and compartment quotas for your tenancy.

  • MySQL Database Service

    Create the MySQL service using a MySQL.HeatWave.VM.Standard.E3 or MySQL.HeatWave.BM.Standard.E3 shape.

  • Connectivity

    Consider using FastConnect if you want a dedicated, private connection between your premises and OCI, otherwise use VCN Connect.

More Information

Links to additional information that can help you learn about, modify, use, or implement this architecture.