Cloud9QL - SQL for NoSQL & Data Transformation

Cloud9QL is Knowi's proprietary version of SQL, designed to simplify data transformation and manipulation directly within the platform. By leveraging Cloud9QL, users can effortlessly clean, filter, and manipulate their data in multiple stages of the analysis process without needing to depend on native query languages of the underlying datastore.

Quickstart Guide
Key Differences Between Cloud9QL and Other Versions of SQL
Where Can I Write Cloud9QL?

Paste your own data below and write your own queries to experiment (or use the default data and queries):
Sent Message Type Customer Week Date Opened Delivered Clicks Campaign_name 239232 Transactional Facebook 4/7/14 4/8/14 17:15:00 56248 223796 14563 Trial 1151481 Marketing Target 4/7/14 4/8/14 21:35:00 253532 1073901 66841 More artists, more music 196746 Transactional Overstock 4/7/14 4/8/14 14:10:00 38094 178767 9429 Newsletter 178966 Transactional Wells Fargo 4/7/14 4/8/14 9:10:00 32189 165333 7354 Trial 165569 Transactional Costco 4/7/14 4/8/14 0:30:00 28642 151550 6929 Trial 165961 Transactional Macy's 4/7/14 4/8/14 22:07:00 24908 159609 6291 Account 784686 Marketing LinkedIn 4/7/14 4/8/14 14:39:00 119524 710909 29721 30% off Limited Sale 145403 Transactional eHarmony 4/7/14 4/8/14 5:34:00 19721 127359 5020 Trial 463954 Marketing Netflix 4/7/14 4/8/14 1:56:00 46237 421751 10732 More artists, more music 246717 Transactional Facebook 3/31/14 4/7/14 10:32:00 61009 221458 16006 Order 1162344 Marketing Target 3/31/14 4/6/14 23:40:00 266435 1071309 68886 More artists, more music 197036 Transactional Overstock 3/31/14 4/7/14 16:44:00 37839 176191 9451 Newsletter 167315 Transactional Wells Fargo 3/31/14 4/7/14 16:35:00 27891 161753 6925 Trial 156206 Transactional Costco 3/31/14 4/7/14 16:43:00 26099 144766 6561 Newsletter 156174 Transactional Macy's 3/31/14 4/7/14 14:16:00 23489 142839 6161 Account 157939 Transactional LinkedIn 3/31/14 4/7/14 12:20:00 24732 149870 6545 Trial 722770 Marketing eHarmony 3/31/14 4/7/14 14:57:00 104588 641357 25453 30% off Limited Sale 491693 Marketing Netflix 3/31/14 4/7/14 8:46:00 45035 449170 11304 More artists, more music 234122 Transactional Facebook 3/31/14 4/6/14 12:22:00 59947 216055 13776 Account 1145564 Marketing Target 3/31/14 4/6/14 12:50:00 257945 1002871 64692 More artists, more music 178931 Transactional Overstock 3/31/14 4/6/14 8:17:00 35048 158948 8577 Trial 811519 Marketing Wells Fargo 3/31/14 4/6/14 20:30:00 135691 764260 35570 Rewards 808258 Marketing Costco 3/31/14 4/6/14 7:48:00 135381 709978 32209 30% off Limited Sale 157983 Transactional Macy's 3/31/14 4/6/14 1:54:00 25788 140572 6133 Account 795276 Marketing LinkedIn 3/31/14 4/6/14 12:43:00 125063 746577 29901 Rewards 706727 Marketing eHarmony 3/31/14 4/6/14 19:00:00 103184 646427 26493 More artists, more music 458645 Marketing Netflix 3/31/14 4/6/14 12:17:00 43144 412840 10902 30% off Limited Sale 250042 Transactional Facebook 3/31/14 4/4/14 23:18:00 58727 235809 14730 Account 1176682 Marketing Target 3/31/14 4/5/14 23:10:00 269357 1041828 69955 30% off Limited Sale 176811 Transactional Overstock 3/31/14 4/5/14 23:59:00 32916 154478 8485 Order 826650 Marketing Wells Fargo 3/31/14 4/5/14 0:23:00 140095 742092 35773 Rewards 154919 Transactional Costco 3/31/14 4/5/14 19:01:00 25564 138083 6416 Account 765949 Marketing Macy's 3/31/14 4/5/14 20:02:00 119581 723706 28973 More artists, more music 150103 Transactional LinkedIn 3/31/14 4/5/14 12:42:00 24402 139424 5625 Order 137842 Transactional eHarmony 3/31/14 4/5/14 14:20:00 19486 129853 5016 Trial 483092 Marketing Netflix 3/31/14 4/5/14 2:04:00 45111 441225 11600 Rewards 241788 Transactional Facebook 3/31/14 4/4/14 5:14:00 60885 216312 14872 Trial 1141743 Marketing Target 3/31/14 4/4/14 13:55:00 265051 1060277 65076 More artists, more music 193891 Transactional Overstock 3/31/14 4/4/14 6:17:00 34931 181405 9019 Account 168199 Transactional Wells Fargo 3/31/14 4/4/14 21:26:00 29859 151303 7038 Order 816831 Marketing Costco 3/31/14 4/4/14 1:48:00 132165 743126 32943 More artists, more music 756574 Marketing Macy's 3/31/14 4/4/14 4:50:00 121110 673001 30000 More artists, more music 753745 Marketing LinkedIn 3/31/14 4/4/14 19:54:00 121300 689765 30644 30% off Limited Sale 140765 Transactional eHarmony 3/31/14 4/4/14 18:42:00 19478 128922 5225 Newsletter 96609 Transactional Netflix 3/31/14 4/4/14 13:39:00 9120 89301 2332 Trial 228272 Transactional Facebook 3/31/14 4/3/14 14:56:00 57490 203542 14492 Order 228638 Transactional Target 3/31/14 4/3/14 20:48:00 49694 207002 13550 Newsletter 175200 Transactional Overstock 3/31/14 4/3/14 14:53:00 33886 159981 7949 Account 851569 Marketing Wells Fargo 3/31/14 4/3/14 16:34:00 140941 802598 36984 30% off Limited Sale 152323 Transactional Costco 3/31/14 4/3/14 20:01:00 26471 140505 6076 Account 153101 Transactional Macy's 3/31/14 4/3/14 12:47:00 24422 136402 6286 Account 769557 Marketing LinkedIn 3/31/14 4/3/14 19:49:00 115177 716390 31024 More artists, more music 685811 Marketing eHarmony 3/31/14 4/3/14 11:54:00 100499 631240 24935 30% off Limited Sale 92825 Transactional Netflix 3/31/14 4/3/14 21:57:00 8872 89408 2240 Trial 247694 Transactional Facebook 3/31/14 4/2/14 13:38:00 59302 234053 14912 Order 1134942 Marketing Target 3/31/14 4/2/14 5:36:00 271382 1044695 62186 30% off Limited Sale 184780 Transactional Overstock 3/31/14 4/2/14 20:31:00 35245 163371 8406 Trial 164343 Transactional Wells Fargo 3/31/14 4/2/14 21:43:00 28871 146737 7290 Order 154470 Transactional Costco 3/31/14 4/2/14 16:43:00 26498 147496 6718 Trial 151798 Transactional Macy's 3/31/14 4/2/14 1:14:00 22876 141343 5683 Account 153808 Transactional LinkedIn 3/31/14 4/2/14 19:06:00 23203 143560 5853 Newsletter 711054 Marketing eHarmony 3/31/14 4/2/14 19:26:00 103669 659913 24045 More artists, more music 92778 Transactional Netflix 3/31/14 4/2/14 5:43:00 8847 83074 2114 Newsletter 1195988 Marketing Facebook 3/31/14 4/1/14 16:14:00 295502 1045824 71109 30% off Limited Sale 1062723 Marketing Target 3/31/14 4/1/14 4:48:00 245303 972240 57830 Rewards 866545 Marketing Overstock 3/31/14 4/1/14 9:43:00 172339 781746 39369 More artists, more music 166468 Transactional Wells Fargo 3/31/14 4/1/14 17:15:00 27014 151976 7008 Trial 807840 Marketing Costco 3/31/14 4/1/14 15:44:00 127857 741497 34304 30% off Limited Sale 747463 Marketing Macy's 3/31/14 4/1/14 17:21:00 114595 709684 28472 30% off Limited Sale 150712 Transactional LinkedIn 3/31/14 4/1/14 3:42:00 22535 140797 6034 Account 133403 Transactional eHarmony 3/31/14 4/1/14 14:50:00 18837 124645 4789 Account 95628 Transactional Netflix 3/31/14 4/1/14 6:02:00 8765 87165 2275 Order 1125694 Marketing Facebook 3/24/14 3/31/14 3:26:00 290455 1020595 72951 Rewards 1128984 Marketing Target 3/24/14 3/31/14 3:23:00 262420 1017237 62040 30% off Limited Sale 171964 Transactional Overstock 3/24/14 3/31/14 9:53:00 31756 153093 7853 Trial 823309 Marketing Wells Fargo 3/24/14 3/30/14 23:45:00 143581 750086 36591 Rewards 761997 Marketing Costco 3/24/14 3/31/14 6:13:00 131520 690801 32318 Rewards 152882 Transactional Macy's 3/24/14 3/31/14 21:55:00 23439 147126 5910 Account 734719 Marketing LinkedIn 3/24/14 3/31/14 21:47:00 117744 672492 28599 Rewards 134676 Transactional eHarmony 3/24/14 3/31/14 17:07:00 19928 129462 4708 Order 89183 Transactional Netflix 3/24/14 3/31/14 17:10:00 8881 83540 2059 Newsletter 234950 Transactional Facebook 3/24/14 3/30/14 0:03:00 59851 217121 14563 Account 217577 Transactional Target 3/24/14 3/30/14 12:49:00 50843 199867 11927 Account 186544 Transactional Overstock 3/24/14 3/30/14 17:13:00 33971 177925 8904 Newsletter 162586 Transactional Wells Fargo 3/24/14 3/30/14 10:32:00 28924 156509 6871 Account 150738 Transactional Costco 3/24/14 3/30/14 14:30:00 24331 137702 6273 Trial 736409 Marketing Macy's 3/24/14 3/30/14 16:36:00 116434 668503 28575 30% off Limited Sale 748648 Marketing LinkedIn 3/24/14 3/30/14 11:23:00 112509 708428 28043 30% off Limited Sale 137796 Transactional eHarmony 3/24/14 3/30/14 9:52:00 18749 121896 4726 Trial 468381 Marketing Netflix 3/24/14 3/29/14 23:44:00 45673 431272 10592 Rewards 229435 Transactional Facebook 3/24/14 3/29/14 17:03:00 59124 214575 13601 Order 1076236 Marketing Target 3/24/14 3/29/14 3:23:00 251761 986007 58394 More artists, more music 185199 Transactional Overstock 3/24/14 3/29/14 3:40:00 35462 161935 8755 Order 163743 Transactional Wells Fargo 3/24/14 3/29/14 14:17:00 27547 157406 7149 Trial 788572 Marketing Costco 3/24/14 3/29/14 8:01:00 127742 709438 31619 More artists, more music 146745 Transactional Macy's 3/24/14 3/29/14 21:53:00 22200 134588 5544 Account 141393 Transactional LinkedIn 3/24/14 3/29/14 22:40:00 21211 124102 5329 Order 132071 Transactional eHarmony 3/24/14 3/29/14 22:06:00 18994 121951 4894 Trial 88658 Transactional Netflix 3/24/14 3/29/14 3:09:00 8281 83004 2104 Newsletter
Cloud9QL Query: select Sent as Sent Messages, Customer where sent > 100000 order by sent messages desc limit 5

Quickstart Guide

Select All Data

select *

Keyword 'select' is optional.

Select Specific Fields

select Sent, Date

Select All Except Specific Fields

Use the ~ symbol before a field name to exclude that field

select *, ~Message_Type, ~Customer_Name

Aliasing Fields

You can rename fields on the fly to make the output more meaningful

select Sent as Sent Messages, Date

Applying Conditions (Filters)

Cloud9QL supports logical operators like: >, >=, <, <=, !=, like, not like, and, or

select * where opened > 100000
select * where campaign_name like 'artists'
select * where Message_Type = 'Transactional' and Sent > 200000

Sorting Data

Order results easily with asc or desc

select * where opened > 100000 order by opened desc

Limiting Results

select * where opened > 100000 order by opened desc limit 1

Unique Records

select distinct *
select distinct customer

Counting Rows

Use count to see how many rows are in your dataset

select count(*)

Or count the number of rows when a condition is met. For example, how many people in the employees table work in HR?

select count(*) where department = 'HR'

Cloud9QL supports the following aggregation functions: count, sum, avg, distinct, max, min, sd, median

Window Functions

Calculate aggregates for each row without grouping:

select id, category, amount, SUM(amount) OVER (PARTITION BY category) as category_total

This shows each row with its category's total, unlike GROUP BY which collapses rows.

Accessing Nested Values

You can access nested values, including elements within arrays, from non-relational databases like MongoDB or JSON files using dot notation and array indexing. Learn more about working with nested arrays and objects here

select 
    nestedObj.a, nestedArr[0], 
    nestedObj.secondLevel.x as SecondLevelObject, 
    nestedObj.secondLevel.y[1] as SecondLevelArray

Unwinding Nested Arrays

Expanding a Single Nested Field:

select customer, nestedObj.secondLevel.y as Nested;
select expand(Nested);

For multiple arrays or fields, use expand_arrays:

select expand_arrays(nestedArr1, nestedArr2);

For filling in missing values while expanding, use expand_arrays_with_defaults:

select expand_arrays_with_defaults(nestedArr1, null, nestedArr2, 0);

Learn more about Cloud9QL's expand functions here

Multiple Statements

You can run multiple SQL statements in a row to create a more complex, multi-step transformation. Each statement uses the output of the previous one as its starting point, and every statement must end with a semicolon.

select department, count(*) as emp_count
group by department 
having count(*) > 5;

select department, AVG(salary) AS average_salary 
group by department;

Comments

Single-Line Comments

Use -- to comment out a single line of text. Everything following -- on that line is ignored.

SELECT * FROM employees;  -- This retrieves all employees

Multi-Line or Block Comments

Use /* */ to comment out multiple lines or a block of text. Everything between /* and */ is ignored:

/* This block of code retrieves employee names and departments. 
We are ignoring other fields for now. */ 

select name, department;

Key Differences Between Cloud9QL and Other Versions of SQL

No FROM Statement Required: In Cloud9QL, queries are always associated with a specific dataset, so there's no need to explicitly include a FROM clause.
Case Insensitivity: SQL keywords (e.g., SELECT, WHERE) and field names are not case-sensitive. However, string values in conditions remain case-sensitive.

Example: select * where region = 'North America'
Spaces Allowed in Field Names: You can include spaces in field names without needing to surround them with quotes.

Example: select Messages Sent
Optional SELECT Statement: The SELECT keyword is optional and can be omitted for simplicity.
Subqueries Not Supported: Cloud9QL doesn't support subqueries, but you can write multiple statements by separating them with a semicolon. You can also use the append function to combine different aggregations from the same dataset.

Where Can I Write Cloud9QL?

At the Query Level

You can use Cloud9QL transformations on datasets both before and after joins, directly within the Queries section.

Select the Editor tab.

You can use Cloud9QL to transform data either alongside or instead of the native query language of the underlying datastore. In this example, the Cloud9QL transformations will be applied after the Mongo Query is executed.

After the data is joined, you can apply a Cloud9QL Post Query to your dataset.

At the Widget Level

Once data is presented in a widget, further transformations can be made through the Analyze Tab.

At the bottom of the analyze tab, select Add Cloud9QL..

Here you can apply your Cloud9QL transformations. Press Preview to see the changes applied to the dataset and Save to apply those changes.

You can also access Cloud9QL Transformations directly from the widget dropdown menu by selecting Cloud9QL.

Cloud9QL is applied as the first transformation step, prior to any analysis. This is only accessible to admins and widget creators.

Aggregations

Aggregation functions enable grouping/dimensions from the data.

Sent	Message Type	Customer	Week	Date	Opened	Delivered	Clicks	Campaign_name
                    239232	Transactional	Facebook	4/7/14	4/8/14 17:15:00	56248	223796	14563	Trial
                    1151481	Marketing	Target	4/7/14	4/8/14 21:35:00	253532	1073901	66841	More artists, more music
                    196746	Transactional	Overstock	4/7/14	4/8/14 14:10:00	38094	178767	9429	Newsletter
                    178966	Transactional	Wells Fargo	4/7/14	4/8/14 9:10:00	32189	165333	7354	Trial
                    165569	Transactional	Costco	4/7/14	4/8/14 0:30:00	28642	151550	6929	Trial
                    165961	Transactional	Macy's	4/7/14	4/8/14 22:07:00	24908	159609	6291	Account
                    784686	Marketing	LinkedIn	4/7/14	4/8/14 14:39:00	119524	710909	29721	30% off Limited Sale
                    145403	Transactional	eHarmony	4/7/14	4/8/14 5:34:00	19721	127359	5020	Trial
                    463954	Marketing	Netflix	4/7/14	4/8/14 1:56:00	46237	421751	10732	More artists, more music
                    246717	Transactional	Facebook	3/31/14	4/7/14 10:32:00	61009	221458	16006	Order
                    1162344	Marketing	Target	3/31/14	4/6/14 23:40:00	266435	1071309	68886	More artists, more music
                    197036	Transactional	Overstock	3/31/14	4/7/14 16:44:00	37839	176191	9451	Newsletter
                    167315	Transactional	Wells Fargo	3/31/14	4/7/14 16:35:00	27891	161753	6925	Trial
                    156206	Transactional	Costco	3/31/14	4/7/14 16:43:00	26099	144766	6561	Newsletter
                    156174	Transactional	Macy's	3/31/14	4/7/14 14:16:00	23489	142839	6161	Account
                    157939	Transactional	LinkedIn	3/31/14	4/7/14 12:20:00	24732	149870	6545	Trial
                    722770	Marketing	eHarmony	3/31/14	4/7/14 14:57:00	104588	641357	25453	30% off Limited Sale
                    491693	Marketing	Netflix	3/31/14	4/7/14 8:46:00	45035	449170	11304	More artists, more music
                    234122	Transactional	Facebook	3/31/14	4/6/14 12:22:00	59947	216055	13776	Account
                    1145564	Marketing	Target	3/31/14	4/6/14 12:50:00	257945	1002871	64692	More artists, more music
                    178931	Transactional	Overstock	3/31/14	4/6/14 8:17:00	35048	158948	8577	Trial
                    811519	Marketing	Wells Fargo	3/31/14	4/6/14 20:30:00	135691	764260	35570	Rewards
                    808258	Marketing	Costco	3/31/14	4/6/14 7:48:00	135381	709978	32209	30% off Limited Sale
                    157983	Transactional	Macy's	3/31/14	4/6/14 1:54:00	25788	140572	6133	Account
                    795276	Marketing	LinkedIn	3/31/14	4/6/14 12:43:00	125063	746577	29901	Rewards
                    706727	Marketing	eHarmony	3/31/14	4/6/14 19:00:00	103184	646427	26493	More artists, more music
                    458645	Marketing	Netflix	3/31/14	4/6/14 12:17:00	43144	412840	10902	30% off Limited Sale
                    250042	Transactional	Facebook	3/31/14	4/4/14 23:18:00	58727	235809	14730	Account
                    1176682	Marketing	Target	3/31/14	4/5/14 23:10:00	269357	1041828	69955	30% off Limited Sale
                    176811	Transactional	Overstock	3/31/14	4/5/14 23:59:00	32916	154478	8485	Order
                    826650	Marketing	Wells Fargo	3/31/14	4/5/14 0:23:00	140095	742092	35773	Rewards
                    154919	Transactional	Costco	3/31/14	4/5/14 19:01:00	25564	138083	6416	Account
                    765949	Marketing	Macy's	3/31/14	4/5/14 20:02:00	119581	723706	28973	More artists, more music
                    150103	Transactional	LinkedIn	3/31/14	4/5/14 12:42:00	24402	139424	5625	Order
                    137842	Transactional	eHarmony	3/31/14	4/5/14 14:20:00	19486	129853	5016	Trial
                    483092	Marketing	Netflix	3/31/14	4/5/14 2:04:00	45111	441225	11600	Rewards
                    241788	Transactional	Facebook	3/31/14	4/4/14 5:14:00	60885	216312	14872	Trial
                    1141743	Marketing	Target	3/31/14	4/4/14 13:55:00	265051	1060277	65076	More artists, more music
                    193891	Transactional	Overstock	3/31/14	4/4/14 6:17:00	34931	181405	9019	Account
                    168199	Transactional	Wells Fargo	3/31/14	4/4/14 21:26:00	29859	151303	7038	Order
                    816831	Marketing	Costco	3/31/14	4/4/14 1:48:00	132165	743126	32943	More artists, more music
                    756574	Marketing	Macy's	3/31/14	4/4/14 4:50:00	121110	673001	30000	More artists, more music
                    753745	Marketing	LinkedIn	3/31/14	4/4/14 19:54:00	121300	689765	30644	30% off Limited Sale
                    140765	Transactional	eHarmony	3/31/14	4/4/14 18:42:00	19478	128922	5225	Newsletter
                    96609	Transactional	Netflix	3/31/14	4/4/14 13:39:00	9120	89301	2332	Trial
                    228272	Transactional	Facebook	3/31/14	4/3/14 14:56:00	57490	203542	14492	Order
                    228638	Transactional	Target	3/31/14	4/3/14 20:48:00	49694	207002	13550	Newsletter
                    175200	Transactional	Overstock	3/31/14	4/3/14 14:53:00	33886	159981	7949	Account
                    851569	Marketing	Wells Fargo	3/31/14	4/3/14 16:34:00	140941	802598	36984	30% off Limited Sale
                    152323	Transactional	Costco	3/31/14	4/3/14 20:01:00	26471	140505	6076	Account
                    153101	Transactional	Macy's	3/31/14	4/3/14 12:47:00	24422	136402	6286	Account
                    769557	Marketing	LinkedIn	3/31/14	4/3/14 19:49:00	115177	716390	31024	More artists, more music
                    685811	Marketing	eHarmony	3/31/14	4/3/14 11:54:00	100499	631240	24935	30% off Limited Sale
                    92825	Transactional	Netflix	3/31/14	4/3/14 21:57:00	8872	89408	2240	Trial
                    247694	Transactional	Facebook	3/31/14	4/2/14 13:38:00	59302	234053	14912	Order
                    1134942	Marketing	Target	3/31/14	4/2/14 5:36:00	271382	1044695	62186	30% off Limited Sale
                    184780	Transactional	Overstock	3/31/14	4/2/14 20:31:00	35245	163371	8406	Trial
                    164343	Transactional	Wells Fargo	3/31/14	4/2/14 21:43:00	28871	146737	7290	Order
                    154470	Transactional	Costco	3/31/14	4/2/14 16:43:00	26498	147496	6718	Trial
                    151798	Transactional	Macy's	3/31/14	4/2/14 1:14:00	22876	141343	5683	Account
                    153808	Transactional	LinkedIn	3/31/14	4/2/14 19:06:00	23203	143560	5853	Newsletter
                    711054	Marketing	eHarmony	3/31/14	4/2/14 19:26:00	103669	659913	24045	More artists, more music
                    92778	Transactional	Netflix	3/31/14	4/2/14 5:43:00	8847	83074	2114	Newsletter
                    1195988	Marketing	Facebook	3/31/14	4/1/14 16:14:00	295502	1045824	71109	30% off Limited Sale
                    1062723	Marketing	Target	3/31/14	4/1/14 4:48:00	245303	972240	57830	Rewards
                    866545	Marketing	Overstock	3/31/14	4/1/14 9:43:00	172339	781746	39369	More artists, more music
                    166468	Transactional	Wells Fargo	3/31/14	4/1/14 17:15:00	27014	151976	7008	Trial
                    807840	Marketing	Costco	3/31/14	4/1/14 15:44:00	127857	741497	34304	30% off Limited Sale
                    747463	Marketing	Macy's	3/31/14	4/1/14 17:21:00	114595	709684	28472	30% off Limited Sale
                    150712	Transactional	LinkedIn	3/31/14	4/1/14 3:42:00	22535	140797	6034	Account
                    133403	Transactional	eHarmony	3/31/14	4/1/14 14:50:00	18837	124645	4789	Account
                    95628	Transactional	Netflix	3/31/14	4/1/14 6:02:00	8765	87165	2275	Order
                    1125694	Marketing	Facebook	3/24/14	3/31/14 3:26:00	290455	1020595	72951	Rewards
                    1128984	Marketing	Target	3/24/14	3/31/14 3:23:00	262420	1017237	62040	30% off Limited Sale
                    171964	Transactional	Overstock	3/24/14	3/31/14 9:53:00	31756	153093	7853	Trial
                    823309	Marketing	Wells Fargo	3/24/14	3/30/14 23:45:00	143581	750086	36591	Rewards
                    761997	Marketing	Costco	3/24/14	3/31/14 6:13:00	131520	690801	32318	Rewards
                    152882	Transactional	Macy's	3/24/14	3/31/14 21:55:00	23439	147126	5910	Account
                    734719	Marketing	LinkedIn	3/24/14	3/31/14 21:47:00	117744	672492	28599	Rewards
                    134676	Transactional	eHarmony	3/24/14	3/31/14 17:07:00	19928	129462	4708	Order
                    89183	Transactional	Netflix	3/24/14	3/31/14 17:10:00	8881	83540	2059	Newsletter
                    234950	Transactional	Facebook	3/24/14	3/30/14 0:03:00	59851	217121	14563	Account
                    217577	Transactional	Target	3/24/14	3/30/14 12:49:00	50843	199867	11927	Account
                    186544	Transactional	Overstock	3/24/14	3/30/14 17:13:00	33971	177925	8904	Newsletter
                    162586	Transactional	Wells Fargo	3/24/14	3/30/14 10:32:00	28924	156509	6871	Account
                    150738	Transactional	Costco	3/24/14	3/30/14 14:30:00	24331	137702	6273	Trial
                    736409	Marketing	Macy's	3/24/14	3/30/14 16:36:00	116434	668503	28575	30% off Limited Sale
                    748648	Marketing	LinkedIn	3/24/14	3/30/14 11:23:00	112509	708428	28043	30% off Limited Sale
                    137796	Transactional	eHarmony	3/24/14	3/30/14 9:52:00	18749	121896	4726	Trial
                    468381	Marketing	Netflix	3/24/14	3/29/14 23:44:00	45673	431272	10592	Rewards
                    229435	Transactional	Facebook	3/24/14	3/29/14 17:03:00	59124	214575	13601	Order
                    1076236	Marketing	Target	3/24/14	3/29/14 3:23:00	251761	986007	58394	More artists, more music
                    185199	Transactional	Overstock	3/24/14	3/29/14 3:40:00	35462	161935	8755	Order
                    163743	Transactional	Wells Fargo	3/24/14	3/29/14 14:17:00	27547	157406	7149	Trial
                    788572	Marketing	Costco	3/24/14	3/29/14 8:01:00	127742	709438	31619	More artists, more music
                    146745	Transactional	Macy's	3/24/14	3/29/14 21:53:00	22200	134588	5544	Account
                    141393	Transactional	LinkedIn	3/24/14	3/29/14 22:40:00	21211	124102	5329	Order
                    132071	Transactional	eHarmony	3/24/14	3/29/14 22:06:00	18994	121951	4894	Trial
                    88658	Transactional	Netflix	3/24/14	3/29/14 3:09:00	8281	83004	2104	Newsletter

Cloud9QL Query:

select sum(sent) as Total Sent, Customer, Message Type
group by Customer, Message Type
order by Total Sent desc

Without GROUP BY

 select sum(sent)
 select sum(sent), avg(sent), count(*), median(sent), max(sent), min(sent)

Supported: sum, count, avg, median, max, min

With GROUP BY

Enables aggregations based on one or more groups/dimensions.

  select sum(sent) as Total Sent, Customer group by Customer

HAVING - Filtering Aggregated Results

The HAVING clause filters results after GROUP BY aggregation, unlike WHERE which filters before aggregation.

  select Customer, sum(sent) as Total_Sent 
  group by Customer 
  having sum(sent) > 100000

You can also use column aliases in HAVING:

  select Customer, sum(sent) as Total_Sent 
  group by Customer 
  having Total_Sent > 100000

Multiple conditions can be combined:

  select Customer, sum(sent) as Total, avg(opened) as Avg_Opened
  group by Customer
  having Total > 50000 and Avg_Opened > 1000

In Analyze Mode UI: The HAVING functionality is available through the "Aggregation Filters" section in the Data Transformation tab. Simply drag aggregated metrics into this section to filter on aggregated values.

Window Functions (OVER with PARTITION BY)

Window functions perform calculations across a set of rows that are related to the current row, similar to aggregate functions but without collapsing the rows into a single output row. Currently, only aggregate window functions are supported.

Syntax

AGGREGATE_FUNCTION(column) OVER (PARTITION BY partition_column[, ...])
AGGREGATE_FUNCTION(column) OVER ()

Supported Window Functions

SUM(column) OVER (PARTITION BY ...) - Sum values within each partition
AVG(column) OVER (PARTITION BY ...) - Average values within each partition
COUNT(*) OVER (PARTITION BY ...) - Count rows within each partition
MAX(column) OVER (PARTITION BY ...) - Maximum value within each partition
MIN(column) OVER (PARTITION BY ...) - Minimum value within each partition

All window functions also support OVER () without PARTITION BY to calculate over all rows.

Examples

Get total sales per category for each row:

select id, category, amount, SUM(amount) OVER (PARTITION BY category) as category_total

Get maximum salary per department:

select employee_id, department, salary, MAX(salary) OVER (PARTITION BY department) as max_dept_salary

Count orders per customer:

select order_id, customer, order_date, COUNT(*) OVER (PARTITION BY customer) as customer_order_count

Get grand total for all rows:

select id, amount, SUM(amount) OVER () as grand_total

Mix window functions with and without partitions:

select id, region, sales, 
       SUM(sales) OVER () as total_sales,
       SUM(sales) OVER (PARTITION BY region) as regional_sales

Multiple partitions:

select year, quarter, region, revenue, SUM(revenue) OVER (PARTITION BY year, quarter) as quarter_total

Multiple window functions in one query:

select product, region, sales,
       SUM(sales) OVER (PARTITION BY region) as region_total,
       AVG(sales) OVER (PARTITION BY product) as product_avg

Note: - Window functions with ORDER BY inside OVER() clause are not currently supported - RANK() window function is not currently supported

ARRAY

Combines values on multiple rows of a given field into an array based on group by field(s).

ARRAY(<field>)
ARRAY(<field>, <remove-duplicates-flag>)
ARRAY(<field>, <remove-duplicates-flag>, <filter-out-null-flag>)

select Stock, array(Price) as Trends group by Stock
select Stock, array(Price, true, true) as Trends group by Stock

Functions

Cloud9QL Query:

select opened, sent, sent-opened as Unresponsive, (opened/sent)*100 as Open Rate

Arithmetic

Arithmetic Operations

Arithmetic operations can be used within the query

select (opened/sent)*100 as Open Rate, Customer

Supported operators:


+
-
*
+
/
^
%
abs
acos
asin
atan
cbrt
ceil
cos
cosh
floor
sqrt
tan

Standard Deviation

Useful to determine variance of a set of values;

select sd(opened) as Std Deviation, Customer group by customer

Date Operations

Cloud9QL will automatically attempt to parse various date formats.

Use str_to_date(<date>,<format>) for unsupported formats.

Cloud9QL Query:

select date(date) as Date Midnight, date(week) as Start of Week, week_of_year(date) as Week Number, quarter(date) as Quarter Start, now() as Current Date/Time

EPOCH_SECS

Allows for the differentiation between the epoch with milliseconds vs without milliseconds. This is specifically for REST queries

For example: date={$c9today,epochsecs} will format today into epoch seconds format

EPOCH_TO_DATE

Converts an Epoch number of seconds to a readable date format.

select epoch_to_date(date) as datetime

DATE

Truncates a date to midnight. When used within group by performs aggregation by date.

select date(date), sent
select date(date) as Sent Date, sum(sent) as Total Sent group by date(date)

DAY_OF_WEEK

Day name of the week (Sunday, Monday etc)

select day_of_week(date), sum(sent) as Total Sent group by day_of_week(date)

DAY_OF_MONTH

Day of the month (1, 2, 3 etc)

select day_of_month(date), sum(sent) as Total Sent group by day_of_month(date)

DAYS_IN_MONTH

Length of the month in days (28, 29, 30, 31 etc)

select days_in_month(date), sum(sent) as Total Sent group by days_in_month(date)

WEEK

Truncates a date to the beginning of the week (Sunday). When used within group by performs aggregation by week.

select week(date) as Sent Week, sum(sent) as Total Sent group by week(date)

By default, WEEK(<date-field>) returns the Sunday date at midnight of a given week but with WEEK(<date-field>, offset), you can offset the days and alter the day of the week returned.

Offset Reference:

1 - Monday

2 - Tuesday

3 - Wednesday

4 - Thursday

5 - Friday

6 - Saturday

Example: WEEK(<date-field>, 1): returns the Monday date at midnight of a given week.

WEEK_OF_YEAR

Week Number integer for the input date

select week_of_year(date) as Sent Week, sum(sent) as Total Sent group by week_of_year(date)

NOTES:

A week is defined from Monday-Sunday regardless of month or year.
All weeks are 7 days long
Weeks are not dependent on the month but it is possible to have 5 weeks associated with a day in a month depending on how the days are aligned, For example, if the 1st of a month falls on a Saturday, then that day will belong to the week starting on Monday which falls in the previous month.
The first week of a year will always follow the last week in December of the previous year

MONTH

Truncates to the 1st of the month. Aggregates data on a monthly basis when used within group by.

select month(date) as Sent Month, sum(sent) as Total Sent group by month(date)

MONTH_OF_YEAR

Month of the year (1, 2, 7, 12 etc)

select month_of_year(date), sum(sent) as Total Sent group by month_of_year(date)

QUARTER

Truncates to the beginning of the quarter. Aggregates data on a quarterly basis when used within group by.

select quarter(date) as Sent Quarter, sum(sent) as Total Sent group by quarter(date)

YEAR

Truncates to the 1st of the year. Aggregates data on a yearly basis when used within group by.

select year(date) as Sent Quarter, sum(sent) as Total Sent group by year(date)

HOUR

Truncates to the hour for dates with timestamps

select HOUR(date) as Sent Hour, sum(sent) as Total Sent group by hour(date)

MINUTE

Truncates/Groups to the minute for dates with timestamps

select MINUTE(timestamp) as Sent Hour, sum(sent) as Total Sent group by MINUTE(timestamp)

NOW

Current date/time

select now()

DATE_FORMAT

Converts a date into another format

DATE_FORMAT(<date>,<format>)

select date_format(date,dd-MMM) as Display Format

Options:


y	Year
M	Month
w	Week of Year
W	Week in month
D	Day in Year
d	Day in Month
F	Day of Week in Month
E	Day name in week. Example: Tuesday,Tue
a	Am/PM marker
H	Hour in day (0-23)
h	Hour in am/pm (1-12)
m	Minute in hour
s	Second in minute
S	Millisecond
z	Time zone
Z	Time zone

DATE_ADD

Add a datetime amount to a date

DATE_ADD(<date>,<amount>)

select date_add(date,+1y) as Date

STR_TO_DATE

Date conversion from a String Converts a string into date using a provided format

STR_TO_DATE(<date>,<format>)

select str_to_date(date,dd-MMM-yy HH:mm) as Converted Date

Date Tokens

The following reserved tokens enable date queries based on current date/time:

$c9_now	Current Time
$c9_thishour	00:00 of the Current hour
$c9_today	Midnight of the current date
$c9_yesterday	Midnight, yesterday
$c9_thisweek	Start of the current week (Sunday midnight)
$c9_lastweek	Start of last week (Sunday midnight)
$c9_thismonth	Midnight of the 1st of the current month
$c9_lastmonth	Midnight of the 1st of the last month
$c9_thisquarter	Midnight of the 1st of the current quarter (Jan, April, July, Oct)
$c9_lastquarter	Midnight of the 1st of the last quarter (Jan, April, July, Oct)
$c9_thisyear	Midnight, Jan 1, of the current year
$c9_lastyear	Midnight, Jan 1, of the last year

select * where date > $c9_thisyear

In addition, these can be further manipulated with +/- operands along with time unit identifiers. For example:

select * where date > $c9_thisyear+2m

Gets data from March onwards

select * where date > $c9_yesterday+2h

Data from 2:00 AM yesterday

Time Units

The following are the list of supported time units:

min	Minutes
h	Hours
d	Days
w	Weeks
m	Months
q	Quarters
y	Years

Timezones

Default timezone is US/Pacific for date display within Knowi. On-premise agents inherit the server timezone.

Custom Timezones can be set in the query using:

set time_zone=US/Eastern;

Full list of Timezones here.

Example:

Cloud9QL Query:

set time_zone=UTC;
select sent, date,$c9_yesterday,now()

Date Deltas

This calculates the amount of time between two date-time objects to a date/time unit. The result will be a positive whole number, even if the end is before the start.

For example, number of minutes between two date-times:

MINUTES_DELTA(<date field>,<date field>)

select minutes_delta('02/28/2015 22:25:34', '01/28/2015 16:28:34') as minutes_delta;
select minutes_delta(now(), date) as minutes_delta;

Number of hours:

HOURS_DELTA(<date field>,<date field>)

select hours_delta('02/28/2015 22:25:34', '01/28/2015 16:28:34') as hours_delta;
select hours_delta(now(), date) as hours_delta;

Days:

DAYS_DELTA(<date field>,<date field>)

select days_delta('02/28/2015 22:25:34', '01/28/2015 16:28:34') as days_delta;
select days_delta(now(), date) as days_delta;

Months:

MONTHS_DELTA(<date field>,<date field>)

select months_delta('02/28/2015 22:25:34', '01/28/2015 16:28:34') as months_delta;
select months_delta(now(), date) as months_delta;

Rolling & Cumulative

Cloud9QL provides a set of operations which can be utilized to calculate rolling and cumulative operations such as accumulate, growth, delta, simple moving average, cumulative moving average, and time moving average.

The standard usecase for these operations is to allow computation <operation> of a <value field> across a set of <dimension field(s)> and optionally grouping by a set of <grouping field(s)>.

<operation>(<value field>[, <grouping field(s)>]);

For example, compute the DELTA of Sent across Week grouping by Customer. In this example:

<operation>: DELTA
<value field>: Sent
<dimension field(s)>: Week
<grouping field(s)>: Customer

Example:

select Customer, delta(Sent, Customer) as SentDelta

There is one important restriction when using these Cloud9 QL functions: the input data needs to be ordered by the <grouping field(s)> and <dimension field(s)> in that order.

Cloud9QL Query:

select sum(Sent) as Sent, Customer, Week
group by Customer, Week
order by Customer, Week;
select Customer, delta(Sent, Customer) as SentDelta

ACCUMULATE

Creates cumulative totals for a field between records, given a sorted dataset.

accumulate(<value field>[, <grouping field(s)>]);

select accumulate(sent), date

The above example returns a cumulative sum of sent count for a pre-sorted date order.

GROWTH

Calculates a growth percentage for a field between records, for a sorted dataset.

growth(<value field>[, <grouping field(s)>]);

 select growth(sent), date

DELTA

Calculates a difference for a field between records, for a sorted dataset.

delta(<value field>[, <grouping field(s)>]);

 select delta(sent), date

SMA

Simple moving average based on a field and a window size for it. Assumes a sorted dataset.

SMA(<value field>, <window size>[, <grouping field(s)>]);

select sma(sent, 10)

CMA

Cumulative moving average returns the moving average of all data up to the current data point.

CMA(<value field>[, <grouping field(s)>]);

select cma(sent)

TMA

Time moving average based on a field, date field, and a window time unit size for it. See Time Units for all available time units. Assumes a sorted dataset

TMA(<value field>, <date field>, <time unit window>[, <grouping field(s)>]);

select tma(sent, date, 1w)

For more details on moving average definitions, see http://en.wikipedia.org/wiki/Moving_average

TMS

Time moving sum based on a field, date field, and window time unit size.

TMS(<value field>, <date field>, <time unit window>[, <grouping field(s)>]);

select tms(sent, date, 1w)

RANK

Rank of records, given a sorted dataset.

rank([<grouping field(s)>]);

select rank(), date

The above example returns the rank (increment by 1) of each row for a pre-sorted date order.

String & Number Operators

ROUND

Specify the number of decimal points to display

ROUND(<field>, <decimal points>)

select round(sent,1)

SUBSTRING

Substring between start and end indexes.

SUBSTRING(<field to check against>, < startIndex>,< length>)

 select substring(Message Type,0,10)

SUBSTRING_BEFORE

Substring before the first occurrence of a delimiter for a field value.

SUBSTRING_BEFORE(<field to check against>, < delimiter>)

 select substring_before(Message Type,someDelim)

SUBSTRING_AFTER

Substring after the first occurrence of a delimiter for a field value.

SUBSTRING_AFTER(<field to check against>, < delimiter>)

  select substring_after(Message Type,someDelim)

CONCAT

Concatenates multiple columns together. When a field name does not exist in the current dataset, a fixed string is used.

CONCAT(<field name>, < anotherfield>, < yetanotherfield>,...)

 select concat(Customer, for Week of, Week)

CONV

For converting hex strings into integers.

CONV (<field name>, <field radix>, <desired radix>)

 select conv(hexfield, 16, 10)

SPLIT

Split a string of elements separated by separator into an array. If separator is not specified, comma will be used.

SPLIT(<field name>, <separator>)

 select split(Customer, ",")

ARRAY_TO_STRING

Join elements of an array value together separated by separator. When a field name does not exist in the current dataset, a fixed string is used.

ARRAY_TO_STRING(<field name>, <separator>)

 select array_to_string(Customer, ", ")

UPPER

Upper cases a string

UPPER(<field name>)

 select upper(Customer)

LOWER

Lower cases a string

LOWER(<field name>)

 select lower(Customer)

TRIM

Removes leading and trailing spaces from a string

TRIM(<field name>)

 select trim(address)

LENGTH

Returns the length of a string.

LENGTH(<field name>)

 select length(address)

CURRENCY_FORMAT

Formats a number to a locale specific currency format. Defaults to US currency format (en_US) if locale is not specified.

CURRENCY_FORMAT(<field name>, <locale>)

CURRENCY_FORMAT(<field name>, <decimal points>)

CURRENCY_FORMAT(<field name>, <locale>, <decimal points>)

  select currency_format(revenue)

Example with Locale:

  select currency_format(revenue,en-GBP)

NUMBER_FORMAT

This function allows you to control the display of leading and trailing zeros, prefixes and suffixes, grouping (thousands) separators, and the decimal separator.

NUMBER_FORMAT(<number>,<format>)

select number_format(clicks,##,###.00) as Number of clicks

The output for the preceding lines of code is described in the following table. The value is the number, a double , that is to be formatted. The pattern is the String that specifies the formatting properties. The output, which is a String, represents the formatted number:

*value*	*pattern*	*output*	*explanation*
123456.789	###,###.###	123,456.789	The pound sign (#) denotes a digit, the comma is a placeholder for the grouping separator, and the period is a placeholder for the decimal separator.
123456.789	###.##	123456.79	The value has three digits to the right of the decimal point, but the pattern has only two. The format method handles this by rounding up.
123.78	000000.000	000123.780	The pattern specifies leading and trailing zeros, because the 0 character is used instead of the pound sign (#).
12345.67	$###,###.###	$12,345.67	The first character in the pattern is the dollar sign ($). Note that it immediately precedes the leftmost digit in the formatted output.
12345.67	\u00A5###,###.###	¥12,345.67	The pattern specifies the currency sign for Japanese yen (¥) with the Unicode value 00A5.

REGEX_REPLACE

Replaces each substring of this string that matches the given regular expression with the given replacement.

In case replacement parameter is not provided an empty string value "" is used as default replacement.

REGEX_REPLACE(<field name>, < regex>) REGEX_REPLACE(<field name>, < regex>, < replacement>)

For example, to replace all occurrences of white spaces in a string

  select regex_replace('Morris Park Bake Shop', '\s') as regex_replaced; 
  ==> MorrisParkBakeShop

REGEX_EXTRACT

Extract and return all matches (non-overlapped) for the regular expression from the given input string field.

In case there is no match, NULL will be returned.

REGEX_EXTRACT(<field name>, <regex>, [<extract groups>])

For example, to extract all string occurrences between (and include) '%' characters

  select regex_extract("|Morris Park| |Bake Shop|", "\|([^|]*)\|");
  ==> ["|Morris Park|","|Bake Shop|"]
  select regex_extract("|Morris Park| |Bake Shop|", "\|(([^|]*))\|", true); 
  ==> [ ["|Morris Park|","Morris Park"],["|Bake Shop|","Bake Shop"] ]

IFNULL

Returns an alternate value to be used in case the specified field does not exist or the value is NULL.

IFNULL(<field name>, <alternate value>)

  select IFNULL(CustomerName, "N/A")

You can also specify an alternate column in place of an alternate value

IFNULL(<field name>, <another field name>)

  select IFNULL(CustomerName, CustomerId)

Other

LAG

Useful to access data from a previous row with an optional row offset.

LAG(<field>[, offset[, default]])

  select LAG(sales, 5)        -- Get sales from 5 rows behind
  select LAG(sales, 3, 0)     -- Get sales from 3 rows behind, default to 0 if none

Users can also group data in the LAG function to look behind within partitions.

LAG(<field>[, offset[, default[ <grouping field(s)>]]])

  select LAG(sales, 1, NONE, customer)     -- Get sales from 1 row behind, default to NONE if none, group by customer

LEAD

Useful to access data from a subsequent row with an optional row offset (opposite of LAG).

LEAD(<field>[, offset[, default]])

  select LEAD(sales, 1)        -- Get next row's sales value
  select LEAD(sales, 2, 0)     -- Get sales from 2 rows ahead, default to 0 if none

Users can also group data in the LEAD function to look ahead within partitions.

LEAD(<field>[, offset[, default[ <grouping field(s)>]]])

  select LEAD(sales, 1, 0, customer)     -- Get sales from 1 row behind, default to 0 if none, group by customer

Example combining LAG and LEAD:

select
    date,
    sales,
    product,
    LAG(sales, 1, 0, product) as prev_sales,
    LEAD(sales, 1, 0, product) as next_sales,
    LEAD(sales, 1, 0, product) - LAG(sales, 1, 0, product) as change_win
    ORDER BY product, date

CASE WHEN (IF..ELSE)

CASE WHEN statements provide great flexibility when dealing with buckets of results or when you need to find a way to filter out certain results. Another way to think of it is it's a conditional logic similar to IF-THEN statements in other programming languages.

When using a CASE WHEN statement, it's important to remember you need a condition, what to do when that condition is met, and an END clause. A simple example is below:

CASE 
    WHEN condition
        THEN result
    ELSE other_result
END

For example,

SELECT
    CASE
      WHEN country = 'USA'
        THEN 'North America'
      WHEN country = 'Mexico'
        THEN 'North America'
      ELSE country
END AS 'West Region'

PERCENTILE

Returns the value of the field for the specified percentile rank.

PERCENTILE(<field>, <percentile>)

select percentile(sent,75)

TRANSPOSE

Pivots row values for a field to columns

TRANSPOSE(<field to transpose>, < value column>)

select transpose(Message Type,Sent)

To collapse based on a key field, use:

TRANSPOSE(<field to transpose>, <current column name>, <key column name>)

select transpose(Message Type, Sent, Customer)

REVERSE_TRANSPOSE

REVERSE TRANSPOSE

Opposite of Transpose, folds columns into rows

Syntax:

REVERSE_TRANSPOSE(<New ID column>, <New Value column>, <Value column 1>, ...., <Value column N>)(, , , ...., )

Example: http://recordit.co/8HRx7aJtmB

As you can see, I have initial data with 5 columns. After executing: select reversetranspose(NEWID, NEWV, V1, V2, V3) I get them value columns folded into the new column, where NEWID's value is the old value column names (V1, V2, V3 one for each new row) and the NEW_V contains the corresponding value.

When you have multiple columns that you want to fold in, specify all the columns you want to pin it by, specify the "pin" columns at the front, followed by new ID and the value column, followed by a *.

Syntax:

REVERSE_TRANSPOSE(Customer, Campaign, State, <New ID column>, <New Value column>, *)

This will fold in all the columns except for the columns in the first section.

INJECT

Injects last value records in for a date range when the values are not present for that date.

For example, if a sensor emits data point 100 for 01/01/2016 and and the next change of value is at 200 10 days later, you can use the inject function to inject 100 into all dates in between that range.

INJECT(<Date Field>, <Start Date for Injecting>, <End Date for Injecting>, <Injection Frequency> [, <Select Fields>])
[group by <Dimension 1>[, ..., <Dimension N>]]

The optional <Select Field> can either be * (for all fields) or a comma separated list of selected fields from input data.

select inject(date, start_range_date, end_range_date, 1d, Name, Division, Score)
group by Name, Division

APPEND

Combines the results of two queries into the same dataset

(select sum(sent) as Facebook where customer=Facebook) append (select sum(sent) as LinkedIn Sent where customer=Linkedin)

Cloud9QL Query:

(select sum(sent) as Facebook Sent where customer=Facebook) 
      append 
  (select sum(sent) as LinkedIn Sent where customer=Linkedin)

Nested

Use the dot notation to query nested elements and the array notation for selecting items in an array. The example below uses a JSON nested string and uses Cloud9QL to parse it.

[
                                   {
                                     "Sent":239232,
                                     "Message Type":"Transactional",
                                     "Customer":"Facebook",
                                     "Week":"4/7/14",
                                     "Date":"4/8/14 17:15:00",
                                     "Opened":56248,
                                     "nestedObj" : {
                                          "a":1
                                      },
                                      "nestedArr" : [50,100]
                                   },
                                   {
                                     "Sent":1151481,
                                     "Message Type":"Marketing",
                                     "Customer":"Target",
                                     "Week":"4/7/14",
                                     "Date":"4/8/14 21:35:00",
                                     "Opened":253532,
                                     "nestedObj" : {
                                          "a":2,
                                          "secondLevel":{
                                            "x":100,
                                            "y":[1000,3000]
                                          }
                                      },
                                      "nestedArr" : [150,100]
                                   }
                                 ]

Cloud9QL Query:

select nestedObj.a, nestedArr[0], nestedObj.secondLevel.x as Second Level Object,nestedObj.secondLevel.y[1] as Second Level Array, sent 
where nestedArr[0]=150

To unwind/expand an array, use the expand syntax.

Example:

select customer, nestedObj.secondLevel.y as Nested;
select expand(Nested);

Note that expand must be specified on its own, without any other elements within the select.

Chaining Statements

Cloud9QL Query:

select sum(sent) as Sent, sum(opened) as opened, customer, month(date) as Month group by customer, month(date);
select (opened/sent)*100 as Open Rate, Opened, Sent, Customer, Month;
select round(Open Rate,1) as Open Rate, Opened, Sent, Customer, Month where Month > $c9_thisyear+2m and Open Rate > 20 order by Open Rate asc

Multiple statements can be chained one after the other using a semi-colon delimiter, where the results of the first statement is passed in to the second and so on.

Example:

select sum(sent) as Sent, sum(opened) as opened, customer, month(date) as Month group by customer, month(date);
select (opened/sent)*100 as Open Rate, Opened, Sent, Customer, Month;
select round(Open Rate,1) as Open Rate, Opened, Sent, Customer, Month where Month > $c9_thisyear+2m and Open Rate > 20 order by Open Rate asc

This:

a) gets the Total Sent and Opens on a monthly basis for each customer,

b) then calculates the open rate based on the data from previous step since March,

c) adds additional criteria along with rounding the open rate.

IP to Geo-Location

IP to GEO function enables city level geo location from IP addresses.

Note: This uses GeoLite2 data created by MaxMind.

Cloud9QL Query:

select ip_to_geo(IP), Some Field

Example:

select ip_to_geo(IP), Some Field

This:

Queries the MaxMind database to determine location fields.
The fields are added as separate columns to the result.

Geocoding - Lat/Long from Address

Retrieves lat/longs from addresses using geocod.io.

Note: This requires your own API key from geocod.io.

Cloud9QL Query:

select concat(address,',',city,',',state) as address;
select geocode(address,626bb3d8cd2c3db255d2223c67f670525d7a5d5)

Example:

select geocode(<fullAddress>,<apiKey>)

This:

Issues a batch geocoding request to geocod.io.
The fields are added as separate columns to the result.

Forecasting/Predictions

Cloud9QL supports the ability to forecast/predict future data points based on previous data history for any dataset.

To predict 1 point in the future:

select PREDICT(<field to predict>, <date field>, <start date for prediction>[, <prediction model>])

To predict multiple data points in the future:

select PREDICT(<Field to Predict>, <Date Field>, <Start Date for Prediction>,
    <Prediction Frequency>, <Prediction Data Points>[, <prediction model>])

To predict multiple data points in the future based on secondary dimension(s) (ie: grouping(s)):

select PREDICT(<Field to Predict>, <Date Field>, <Start Date for Prediction>,
    <Prediction Frequency>, <Prediction Data Points>[, <prediction model>])
group by <Dimension 1>, ..., <Dimension N>

This:

Loads the data points from input data.
Determines the optimum prediction model.
Predicts future data point(s).
<prediction frequency> is in the format of <Number><TimeUnits> for example 1d means daily 2m specifies every 2 months.
<prediction model> you can choose a specific model to be applied from one of the following supported models (case-sensitive):
- Regression
- PolynomialRegression
- MultipleLinearRegression

Example:

Cloud9QL Query:

select predict(Count, Date, 07/10/2015, 1d, 5)

Cohort

Cohorts are useful to determine a grouping of data across a date dimension. For example, how many users remain active n months after initial sign up, etc.

We currently support 2 types of input data.

select COHORT(<Date Column>, <Cohort Date Definition>, <Cohort Period Definition>), <Cohort Operation>
group by <Cohort Date>, <Cohort Period>

Note:

Input data needs to be sorted by Date ascending order.

Cohort Period returns a number (ie: the period) or a date. Example:

a. 1m: Cohort Period as number

b. (1m): Cohort Period as Date

Example 1: If we already have the cohort date populated

Cloud9QL Query:

  select 
      cohort(
          Transaction Date, 
          Register Date as Cohort Date, 
          1m as Cohort Period),
      sum(Amount) as Total Amount
  group by Cohort Date, Cohort Period

Example 2: If we only have transactional events like the following example:

Cloud9QL Query:

  select 
      cohort(
          Transaction Date, 
          Transaction Date as Cohort Date where Event Type = Sign Up group by User ID, 
          1m as Cohort Period),
      sum(Amount) as Total Amount
  where Event Type = Purchase
  group by Cohort Date, Cohort Period

Example 3: Cohorts can be used in combination with transpose to flatten the result based on date

Cloud9QL Query:

  select 
      cohort(
          Transaction Date, 
          Transaction Date as Cohort Date where Event Type = Sign Up group by User ID, 
          1m as Cohort Period),
      sum(Amount) as Total Amount
  where Event Type = Purchase
  group by Cohort Date, Cohort Period;
  select transpose(Cohort Period, Total Amount, Cohort Date);

Example 4: A common cohort is retention in percentage format which can be computed as follows:

Cloud9QL Query:

  select 
      cohort(
          Transaction Date,
          Transaction Date as Cohort Date where Event Type = Sign Up group by User ID,
          1m as Cohort Period,
          Cohort Count),
      count(distinct(User ID)) as Retention
  where Event Type = Purchase and Cohort Date is not null
  group by Cohort Date, Cohort Period;
  select Cohort Date, Cohort Period, Cohort Count, 
      Retention * 100 / Cohort Count as Retention Percent;

Expand

Unwinds nested field values: This function will expand array, map, or array of maps data structure into rows.

select EXPAND(<Column with Nested Value>)

Example: In this example, Name field's value is an array of map of Last Name and First Name.

Cloud9QL Query:

    select expand(Name)

Note that this function must be used in isolation, i.e., cannot be used in combination with others. Use query chaining to manipulate the results:

select expand(Name);
select * where First Name like John

EXPAND_ARRAYS

Unwinds multiple nested field values: This function can expand multiple arrays, maps, or arrays of maps data structures into rows.

select EXPAND_ARRAYS(<Column with Nested Value 1>, ..., <Column with Nested Value N>)

Example: In this example, there are two nested objects Grade and Address. Grade field's value is an array of map of three fields date, grade, and score. Address field's value is a map of four fields building, coord, street, zipcode.

Note that this function must be used in isolation, i.e., cannot be used in combination with others. Use query chaining to manipulate the results:

select nestedObj1 as Nested1, nestedObj2.secondLevel.y as Nested2;
select expand_arrays(Nested1, Nested2);

Nested fields before unwinding with EXPAND_ARRAYS() function:

expandarray

To unwind/expand multiple arrays, use the expand_arrays syntax.

Example:

select nestedObj1 as Nested1, nestedObj2.secondLevel.y as Nested2;
select expand_arrays(Nested1, Nested2);

Nested fields after unwinding with EXPAND_ARRAYS() function: afterexpandarray

Note that expand_arrays must be specified on its own, without any other elements within the select.

EXPANDARRAYSWITH_DEFAULT

A more powerful version of EXPAND_ARRAYS. This function can expand multiple arrays, maps, or arrays of maps data structures into rows. It also allows you to fill in blank fields of expanded arrays with a default value like nulls or a chosen value if the arrays are different in size.

EXPAND_ARRAYS_WITH_DEFAULTS(<field 1>, 0, <field 2>, now(), <field 3>, LAST, <field 4>, <value from another field on the same row>, ..., <field N>, NULL)

Example: In this example, there are two nested objects Grade and Address. Grade field's value is an array of map of three fields date, grade, and score. Address field's value is a map of four fields building, coord, street, zipcode.

When you unwind grade and coord with EXPAND_ARRAYS(), you find that grade has more rows than coord.

Expand Array

With this function, you can choose a default value to fill in the blank spaces. This value will follow that object in the function. Below, we chose to fill in the blank spaces for the coord field with 0.

Note that a default value is required for all fields. If there is no particular default value you wish to add, simply enter null after the field as seen below for the field grade.

select grade, address.coord as coord;
select expand_arrays_with_defaults(grade, null, coord, 0);

The Result:

Expand Array

To unwind/expand a multiple arrays and fill in values in the function, use the expand_arrays_with_defaults syntax.

Example:

select nestedObj1 as Nested1, nestedObj2.secondLevel.y as Nested2;
select expand_arrays_with_defaults(Nested1,null, Nested2,0);

Note that expand_arrays_with defaults must be specified on its own, without any other elements within the select.

Parse

In case your data is a JSON string, PARSE function can be used to convert it to object which can be further manipulated and processed.

select PARSE(<JSON String Column>)

Example: In this example, Name field's value is an array of map of Last Name and First Name.

Cloud9QL Query:

select parse(Profile) as Profile, 
  parse(Versions) as Versions, 
  parse(description) as Description;
select Profile.name, Description

Encryption/Obfuscation

In case your data needs to be encrypted/obfuscated for storing, you can use the ENCRYPT and DECRYPT functions to achieve this goal.

Basic Usage (Default Key)

The simplest form uses Knowi's default encryption key:

select ENCRYPT(<Column>)
select DECRYPT(<Column>)

Example:

select ENCRYPT(sensitive_data) as encrypted_value
select DECRYPT(encrypted_column) as original_value

Custom AES-128 Key Encryption

For enhanced security, you can provide your own AES-128 encryption key:

select ENCRYPT(<Column>, <AES-128 Key>)
select DECRYPT(<Column>, <AES-128 Key>)

Where <AES-128 Key> must be a Base64-encoded AES-128 key (exactly 16 bytes when decoded).

Example:

select ENCRYPT(ssn, 'OmYXssauKqtL3r8vnE5GAg==') as encrypted_ssn
select DECRYPT(encrypted_ssn, 'OmYXssauKqtL3r8vnE5GAg==') as ssn

Important Notes

Custom keys must be Base64-encoded AES-128 keys (exactly 16 bytes/128 bits when decoded)
Encryption uses AES with either CBC or GCM mode
The same key must be used for both encryption and decryption
Encrypted values are returned as Base64-encoded strings
Invalid key formats will result in an error

Window Functions

Window functions let you compute values across a set of related rows, while keeping each row in the result. Use them for things like per group averages or totals, without collapsing rows with a GROUP BY.

SELECT
column_name1,
window_function(column_name2) OVER (PARTITION BY column_name3) AS new_column
FROM table_name;

window_function is an aggregate such as SUM, AVG, COUNT, MIN, MAX
PARTITION BY groups rows for the window
Omit PARTITION BY to treat the entire result as one window

Example:

Add a new column to your dataset that calculates an aggregate value from another column. For example, create a column called Store_total that contains the sum of all values in the Purchase Total column.

Select *, SUM(Purchase Total) OVER() as Store_Total;

Window

Add a new column that aggregates data from another column, but groups the calculation by a categorical column.

Select *, SUM(Purchase Total) OVER(PARTITION BY Department) as Department_Total;

Window

The current implementation supports the OVER clause with an optional PARTITION BY, enabling the use of aggregate functions within the window, including SUM, AVG, COUNT, MIN, and MAX. At present, ORDER BY within OVER is not supported, and ranking functions such as ROWNUMBER, RANK, and DENSERANK are not yet available.