Links

SQLake cost estimate

Upsolver SQLake provides a transparent and simple to understand value-based pricing model. To estimate the SQLake costs you simply need to understand the amount of data you will ingest.
Upsolver offers 3 editions - developer sandbox, on-demand, and annual commitment. To learn more visit the SQLake pricing page.
When you deploy SQLake in your AWS VPC, Upsolver will automatically configure the required infrastructure. This infrastructure gives you full control over your data and ensures that none of it leaves your AWS account. You can learn more about these components on the Deploying to AWS page. Upsolver does not charge you for infrastructure resources, they are billed directly by AWS. However, Upsolver will attempt to optimize how it uses these resources to reduce operational costs, like leveraging EC2 Spot instances when applicable.

SQLake Costs

  • SQLake's base usage fee is $99/TB.
  • If you become an annual customer, SQLake has a platform fee that covers 24X7 support and unlocks additional features.

Cost estimates for the AWS resources used to run SQLake

Prices are based on the US East region. AWS users with custom pricing/discounts will use their rates. Last updated Feb 2023.
Predictable infrastructure costs:
  • One Kinesis shard, in a provisioned mode, for about $15 a month regardless of deployment size. Kinesis is used for synchronization within SQLake. See the AWS Kinesis pricing page.
  • One r6i.xlarge for the API server, using spot instances, for about $60, depending on spot availability. The API server runs the user interface and handles all CRUD operations (e.g. create a job) and settings management via the user interface or CLI. See the AWS EC2 Spot Instances pricing page.
Variable costs:
  • In order to process up to 10TB of data in a month, you will likely need 1 server, of size 2xlarge (e.g. r6i.2xlarge). Current spot pricing is about $75 per month. By default, Upsolver uses Spot instances, which results in a 60%-80% reduction in compute costs over on-demand instances. The cluster auto-scaling feature allows you to only pay for what you use. See the AWS EC2 Spot Instances pricing page.
  • $23 a month per 1TB stored on the data lake. The size of your data lake will depend on many factors, the primary of which is how long data is retained. You can also turn on retention for tables to delete data you no longer wish to pay for. See the AWS S3 pricing page.
  • SQLake manages table metadata in the AWS Glue Data Catalog which bills based on the amount of metadata stored and the number of API requests. This often represents a small part of the total cost associated with data ingestion and transformation. See the AWS Glue Pricing page.

Total cost estimates per use case

Ingesting 3TB of data from S3 over a month
  • SQLake usage fee - 3 * $99 = $297 SQLake usage fee.
  • Data lake - 3 * $23 = $69 for S3 storage of newly ingested data and some data retained from previous months.
  • Computing - $75 for one 2xlarge server.
  • Infrastructure (Kinesis and API server) - $75
A total estimate of $516 = $297 paid to Upsolver and $219 paid to AWS.
Ingesting 10TB of data from your Kinesis over a month
  • SQLake usage fee - 10 * $99 = $990.
  • Data lake - 10 * $23 = $230 for S3 storage of newly ingested data and some data retained from previous months.
  • Computing - $75 for one 2xlarge server.
  • Infrastructure (Kinesis usage by SQLake and API server) - $75.
A total estimate of $1370 = $990 paid to Upsolver and $380 paid to AWS.
Ingesting 100TB of data from S3 over a month
  • SQLake usage fee -100 * $99 = $9900.
  • Data lake - 100 * $23 = $2300 for S3 storage of newly ingested data and some data retained from previous months.
  • Computing - 10 * $75 = $750 for 2xlarge servers. Each serves 10TB of data.
  • Infrastructure (Kinesis and API server) - $75
A total estimate of $13025 = $9900 paid to Upsolver and $3125 paid to AWS