Infrastructure at your Service

Category

AWS

Daniel Westermann

Getting started with Exasol – A multi node cluster in AWS

By | AWS, Database Administration & Monitoring | No Comments

As of now, all the previous posts worked with the Community Edition of Exasol. While that is great for getting started quickly (the Community Edition comes with all the features, except multi node support and a limit of 200GB of data) there are some limitations: Can you not really test how data is distributed across cluster nodes because there is only one. You do not have a real access to the operating system as the…

Read More
Franck Pachot

Amazon Aurora Serverless (PostgreSQL compatibility)

By | AWS, Cloud | No Comments

By Franck Pachot . I’ve written a blog post about serverless databases and here is an example of Amazon RDS Aurora PostgreSQL in serverless mode: When I’ve created the instance (15:55 – CloudWatch is GMT+2 but event log is UTC), it started with 0 capacity unit (18:03), which means that it was paused (you pay for storage only). Then I connected and the instance was resumed (18:19) to its maximum capacity unit (4 here). And…

Read More
Franck Pachot

Amazon or AWS services?

By | AWS, Cloud | No Comments

By Franck Pachot . When I’m writing about a product I like to be precise about the name, the upper and lower case, and even more: do you know that was taking special care of writing Oracle 12cR2 before then non-italic came with 18c? And that’s also the reason I’m not writing a lot about VMware as it takes me 5 minutes to put the uppercase right 😀 You may have seen that many Amazon…

Read More
Franck Pachot

What is Object Storage?

By | AWS, Cloud, Oracle | No Comments

By Franck Pachot . I’ve always been working with databases. Before the cloud era, the most abstract term was “data”. A variable in memory is data. A file is data. A block of disk contains data. We often created a ‘/data’ directory to put everything that is not binaries and configuration files. I’ll always remember when I did that while working in Dakar. My colleagues were laughing for minutes – my Senegalese followers will understand…

Read More
Franck Pachot

Amazon DynamoDB: a r(el)ational Glossary

By | AWS, Cloud | No Comments

By Franck Pachot . There are many NoSQL databases. And, because SQL is an ISO standard, “No SQL” also means “No Standard”. Many have a similar API and similar objects, but with completely different names. Today, NoSQL databases are used as an additional datastore for some well-defined use cases for which a hashed key-value store fits better than a relational table. And it quickly became “Not Only SQL” as it is complementary to RDBMS databases…

Read More
Franck Pachot

Amazon DynamoDB: the cost of indexes

By | AWS, NoSQL | No Comments

By Franck Pachot . That’s common to any data structure, whether it is RDBMS or NoSQL, indexes are good to accelerate reads but slow the writes. This post explains the consequences of adding indexes in DynamoDB. Secondary Indexes What we call an index in DynamoDB is different from an index in RDBMS. They have the same goal: store your data with some redundancy in order to have it physically partitioned, sorted, and clustered differently than…

Read More
Franck Pachot

Amazon DynamoDB Local: running NoSQL on SQLite

By | AWS | No Comments

By Franck Pachot . DynamoDB is a cloud-native, managed, key-value proprietary database designed by AWS to handle massive throughput for large volume and high concurrency with a simple API. simple API: Get, Put, Query, Scan on a table without joins, optimizer, transparent indexes,… high concurrency: queries are directed to one shard with a hash function massive throughput: you can just add partitions to increase the IOPS large volume: it is a shared-nothing architecture where all…

Read More

RDBMS (vs. NoSQL) scales the algorithm before the hardware

By | AWS, NoSQL | No Comments

By Franck Pachot . In The myth of NoSQL (vs. RDBMS) “joins dont scale” I explained that joins actually scale very well with an O(logN) on the input tables size, thanks to B*Tree index access, and can even be bounded by hash partitioning with local index, like in DynamoDB single-table design. Jonathan Lewis added a comment that, given the name of the tables (USERS and ORDERS). we should expect an increasing number of rows returned…

Read More
Franck Pachot

DBPod – le podcast Bases de Données

By | AWS, Cloud, Database Administration & Monitoring, Database management, Development & Performance, DevOps, NoSQL, Oracle, Postgres, SQL Server | No Comments

By Franck Pachot . J’essaie quelque chose de nouveau. Je publie beaucoup en anglais (blog, articles, présentations) mais cette fois quelque chose de 100% francophone. En sortant du confinement, on reprend les transports (train, voiture,…) et c’est l’occasion de se détendre en musique mais aussi de s’informer avec des podcasts. J’ai l’impression que c’est un format qui a de l’avenir: moins contraignant que regarder une video ou ou lire un article ou une newsletter. Alors…

Read More
Franck Pachot

The myth of NoSQL (vs. RDBMS) “joins dont scale”

By | AWS, Database Administration & Monitoring, NoSQL, Oracle, Postgres, SQL Server | 15 Comments

By Franck Pachot . I’ll reference Alex DeBrie article “SQL, NoSQL, and Scale: How DynamoDB scales where relational databases don’t“, especially the paragraph about “Why relational databases don’t scale”. But I want to make clear that my post here is not against this article, but against a very common myth that even precedes NoSQL databases. Actually, I’m taking this article as reference because the author, in his website and book, has really good points about…

Read More