Cassandra Data Types: Built-in, Collection, & User-defined

June 10, 2020

Introduction

Apache Cassandra uses CQL (Cassandra Query Language) for communicating with its database. Cassandra is similar to SQL as it also stores data in tables, organizing it into rows and columns.

Cassandra stores data in variables. Each variable has an assigned data type that defines the type (or range) of the values it can store, and what operations it can perform without causing an error.

Read on to learn about Cassandra data types and how they differ.

tutorial on data types in cassandra

Cassandra Data Types

Apache Cassandra supports a rich set of data types, including:

  1. Built-in data types
  2. Collection data types
  3. User-defined data types

Note: Apache Cassandra is a Wide-Column NoSQL database. If you want to learn more about these types of databases, read NoSQL database types. And if you are interested in NoSQL core concepts and features, refer to What is NoSQL.


Built-In Data Types

Cassandra has many data types for which it provides built-in support. These are also referred to as primitive data types. They come pre-defined and you can directly refer to any of them.

Data TypeConstantsDescription
asciistringsASCII is a data type that includes character encoding used for strings. In it, numeric code represents characters (for instance, T is 84). While the standard ASCII can depict 128 characters, the extended version incorporates 256 characters.
BooleanbooleansBOOLEAN is used for variables that have one of two possible values. These values are stored as 16-bit numbers, but they can only be True or False.
blobblobsBLOB is short for “Binary Large Object” and its utilized for storing binary data. As it represents arbitrary bytes, it is mainly used to store images, videos, and audio files. Because of their size, they require more space compared to other data types.
decimalintegers, floatsDECIMAL data types are convenient for storing currency data due to the precision it offers. It is used for numeric values that consist of two components:   precision (number of digits: 5.754) scale (digits that come after the decimal point: 5.754) It stores the value 5.754 as two separate units: 5 (precision) and 754 (scale).
doubleintegersIf you need to store decimal values that do not require the level of precision of currency values, you can use the DOUBLE data type. It represents a 64-bit floating point and is used for integers.
floatintegers, floatsThe FLOAT data type stores decimal point values. It is a single-precision, representing a 32-bit floating point. You shouldn’t use it with data that requires high accuracy since it is not as precise as decimal data type representation.
intintegersINT data type is used to store 32-bit signed integers.
smallintintegersSMALLINT stores 16-bit signed integers.
bigintintegersBIGINT stores 64-bit signed integers.
textstringsTo store data you can use TEXT data types used for text data, represented in UTF8 encoded strings.
varcharstringsUse VARCHAR for variables or arbitrary characters. It stores in UTF8 encoded strings for which you can also determine the maximum size.
inetstringsTo store strings of characters that don’t require any arithmetic operations utilize the INET data type. Use it to save and manage IP addresses since it supports both numeric and character representation. INET can store IPv4 and IPv6 host addresses.
counterintegersThe COUNTER data type is used for 64-bit integral values and stores them in counter columns. This data type supports two operations: incrementing and decrementing, and is commonly used to count page views.
timeintegers, stringsYou can store time values in the following format: hh:mm:ss using the time data type. It offers nanosecond precision and supports data in integers and strings.
dateintegers, stringsAccordingly, you can store date values in the format: YYYY-MM-DD. This data type also supports integers and strings.
timestampintegers, stringsThe TIMESTAMP data type is a combination of the previously mentioned two. It is used for values that include time and date values in the format: YYYY-MM-DD hh:mm:ss

Note: To learn more about Cassandra, see Cassandra vs MongoDB.


Collection Data Types

You can use one of the collection data types if you want to store multiple values into one unit.

Cassandra supports three kinds of collection data types:

  1. Maps. Cassandra can store data in sets of key-value pairs using the Map data type. It allows you to store data and assign labels (key names) to it for easier sorting.
  2. Sets. You can store multiple unique values, using the Set data type. Bear in mind that the elements will not be stored in order.
  3. Lists. If you need to store multiple values in a specific order, you can use the List data type. Unlike sets, lists can store duplicate values.
Cassandra collection data types.

User-Defined Data Type

The last type of Cassandra data types are User-Defined data types (UDTs). As the name suggests, it allows you (the user) to create your own data type based on the requirements you need.

A UDT consists of multiple data fields of any data type inside a single column. Once you create your user-defined data type, you can change or even remove the fields inside of it.

Conclusion

Now you should have an understanding of the primary groups of data types used in Apache Cassandra: Built-in, collection, and user-defined. This NoSQL database offers a broad scope of data types to utilize while storing and managing your data.

If you would like to test this out, check out our article on how to install Cassandra on Ubuntu or how to install Cassandra on your Windows machine.

Was this article helpful?
YesNo
Sofija Simic
Sofija Simic is an experienced Technical Writer. Alongside her educational background in teaching and writing, she has had a lifelong passion for information technology. She is committed to unscrambling confusing IT concepts and streamlining intricate software installations.
Next you should read
PostgreSQL Vs MySQL: A Detailed Comparison
March 30, 2023

Explore the differences between the two most widely used database management systems...
Read more
How to Create MySQL Database in Workbench
January 29, 2020

Workbench is a cross-platform, open-source, visual tool for database management. It provides a user-friendly...
Read more
Cassandra vs MongoDB - What are the Differences?
April 27, 2020

Learn about the difference between Cassandra and MongoDB. These NoSQL databases have some similarities, but...
Read more
How to Install Cassandra on Windows 10
June 8, 2020

This in-depth tutorial explains how to install Cassandra on Windows 10. The abundance of images in this...
Read more