Skip to content

Operations General Information#

Operations enable you to combine, filter, and transform your various sources of data into the new datasets you need for analysis.

Operations are applied in the Workbench of a Project and can only be applied to Datasets created in that Workbench. All operations already applied to a dataset are displayed in a stack in the Operation Toolbox, with the most recent on top.

Manage Operations in the Recipe#

Add to recipe: Add a new operation to your active dataset by clicking on "+ Add to Recipe" in the Operation Toolbox and select the operation. You can also add an operation to any dataset in your flow by right-clicking it and selecting the "Add to Recipe" option from the context menu.

Delete an operation from the recipe: Delete an operation from the stack by moving your cursor over the operation you wish to delete and clicking the "Trash" button that will appear to the right. Note that because each operation depends on the output of the operations below it, only the top operation can be deleted. If you need to delete an operation further down the stack, you must first delete the operations on top of it.

Modify an applied operation: Modify an operation by clicking on it in the stack and changing the existing values. Note that changes will fail if they would cause errors in operations higher up the stack.

Supported Column Names#

A column name in Datameer may contain any character except a (`) (backtick). Note that column names inside the same dataset must be unique and are not case sensitive (e.g. column names "State" and "STATE" cannot both be used in the same dataset).

Dataset Size Limits #

Datameer has a default maximum dataset size of 3 million records for datasets that applies both when adding new tables and when a dataset grows as a result of an operation. Datasets above this limit are flagged with an warning triangle - yellow alert on search cards and on their detail pages.

If a dataset grows beyond the record count limit, it will stop feeding data to any downstream datasets that have been built from it in Datameer.