Datameer Changelog#
Datameer provides here all new features and enhancements introduced for the recent and previous Datameer releases.
November 2023#
New Features | |
---|---|
Jobs | When creating an unload job, the output path can be dynamically configured based on an expression. The wizard offers a dropdown menu for partitioning output data. |
'Unload Job' supports exporting deployments as CSV to cloud storage. | |
Monitoring | The 'Monitoring Dashboard' presents a collection of basic metrics for tables. You can access this information from the 'MONITORING' tab on the Datameer Home page. Tables can be added or removed with a click in the dataset browser. Data about freshness, inserts, deletes, and updates are collected on an hourly basis. |
The thresholds for the table checks can be configured in the data quality inspector. | |
Improvements | |
Jobs | Scheduled job runs are displayed in the job timeline on the 'Jobs Dashboard'. |
In the jobs definition inspector a section was added to display audit details. | |
In the environment inspector a section was added to display audit details. | |
Job runs can be filtered for status and for execution within the last 24 hours. | |
The environments list can be sorted by name, username, role, warehouse. | |
The job definitions list can be sorted by type, name, environment, trigger and last modified date. | |
Budget Controls | The 'Usage Dashboard’ displays a detailed graph with several filters. A bar chart is shown when the date-time cardinality is below 50. |
Exploration | Overflow exploration tabs are organized in a drop-down menu. |
Transformation | Union: Unmapped columns can be added from a source to the output columns via drop-down menu. |
Union: Output columns can be reordered by drag and drop. | |
UI | The 'Jobs Dashboard' appearance has been optimized for small resolution screens. |
For improved visibility on the 'Jobs Dashboard' each job displays a status icon. |
October 2023#
Topic | What Changed? |
---|---|
Jobs | The 'Jobs Dashboard' has been redesigned to make managing your Jobs more efficient. It offers various statistics to view Jobs from different angles, such as numerical insights, a top-three overview, and the most recent three Job results. |
The job run history is now accessible through a Job's Inspector, providing detailed information about Job runs and error messages. | |
For improved usability, dashboard filters have been relocated to the left side, allowing users to select different filtering options based on the period, Jobs, environment, and more. | |
If an error occurs during environment setup, it is no longer displayed in the snackbar; instead, it is directly shown in the environment setup dialog. | |
You can now delete a Job definition and an Environment directly from the Job Inspector. | |
The wording and execution trigger configuration for triggering a Job from the Job Inspector have been enhanced. | |
Project entries with no valid outputs in the table selection tree within the Job Definition dialog are now filtered out. | |
Budget Controls | The Warehouse Usage report provides information on how a Warehouse is utilized. You can access this information from the 'USAGE' tab on the Datameer Home page. It displays the total spending for each Warehouse and visualizes spending trends by month. You can explore further details by clicking on a specific Warehouse in the table and view metadata information in the Inspector. The Warehouse overview table includes the name, costs, and trend for each Warehouse. To delve deeper into your costs, you can click the 'Warehouse Usage' link button, allowing you to filter by timestamp, Warehouse, and user. You can also aggregate Warehouse data with different measures and date bins. |
Administrators can now configure a Usage Table Creation job on a separate subpage as part of the 'USAGE' page. This enables administrators to control access to the dashboards and schedule data computation based on allocated budgets. | |
Transformation | Union: In each Union transformation, you can now delete columns using the 'x' icon, which triggers a confirmation pop-up. Removing the Output column also removes all associated input source mappings. |
Union: When configuring a union, you can click on the type of each Output column and select a new column type. Note that changing the column type may result in coercion errors in the preview. | |
Exploration | In explorations, a quick filter pill is now displayed when defining a numeric dimension, and the same applies when defining a date/datetime dimension. |
UI | Indicator icons have been enhanced to improve their visibility and understanding. |
September 2023#
Topic | What Changed? |
---|---|
Datameer Home Browser | The updated Data Browser on the Datameer Home page offers various filtering options, including 'Created by Datameer,' 'In Use,' and 'Scheduled. |
Transformations | Aggregation - When utilizing a STRING column as a 'Measure' in the Aggregation Editor, you will now find 'MIN' and 'MAX' functions listed. |
Jobs&Environments | Datameer's Job feature empowers authorized users to create isolated environments for task execution. You can manage multiple environments for different jobs, accessible through the Datameer Home browser. Each job/environment includes detailed settings accessible in the associated Inspector. |
Exploration | A tree-map visualization is now available when a single dimension and one or two metrics are present. This visualization facilitates comparing usage entities such as warehouse, user, or role with associated metrics, making it easier to identify entities requiring attention or to compare and contrast entities. |
When explorating data, you can now use a new quick filter option: You can now choose a subset of STRING dimensions column’s values. | |
Additionally, exploration filters like 'does not start with,' 'does not end with,' and 'does not contain' are now available. | |
When filtering BOOLEAN columns in the 'Explore' dialog, the operator list is refined to 'Is True,' 'Is False,' 'Is Null,' and 'Is not Null.' | |
Usage Report | When triggering a Usage Report as an Admin, you can now select the Project in the configuration dialog. The default would be the current Project, if no other Project will be selected. |
A JSON-formatted Query Tag is introduced for each query executed by Datameer on behalf of the user, providing query details. | |
The Usage Report now includes two predefined exploration nodes for common schema usage questions: the most queried objects within the last 30 days and objects that haven't been queried in the last 30 days, ordered by last activity. | |
Budget Controls | Admins and central IT teams needing to report budget usage to business owners now have a warehouse entry in the Data Grid showing year-to-date spending and trends, indicating whether spending is increasing, decreasing, or remaining stable. |
UI Improvements | The Data Preview buttons are rearranged for improved usability. |
Icons for deleted Snowflake tables in a Project Input/Output table are standardized, including a new 'Unavailable Table' icon for deleted Snowflake tables and sources. | |
The UI for each node type Inspector has been enhanced for consistency. |
August 2023#
Topic | What Changed? |
---|---|
Usage Reporting | The new Usage Reporting feature enables you to generate datasets and reports for objects stored in Snowflake (such as tables and views) and track warehouse usage. Previously, you could only create Schema Usage Reports from the Data Browser. Now, you have access to predefined usage reports like 'Most queried in the last 30 days' and 'Not queried in the last 30 days'. |
Table Materialization | You can now initiate table materialization directly from the 'Deployment' tab of a deployed table node. You can choose both the role and warehouse, and you will receive feedback indicating success or error. Additionally, you have the option to cancel the materialization at any time. If the deployment is out of sync, the button will be disabled, and you must redeploy the node. It's important to note that ad-hoc materializations will use the currently deployed version and won't affect the deployment configuration. |
**External API Materialization ** | You can now trigger ad-hoc table materialization from an external orchestration application using an execution trigger. You can set up this trigger in the deployment node's Inspector, where you can specify a callback URL to receive an email with deployment data. |
Data-driven Table Materialization | With the new data-driven table materialization feature, you can associate materializations across different projects, addressing dependencies more effectively. |
Deployment | Restoring a version from the Deployment History will no longer restore former scheduling settings. |
Browsing Data | When you collapse a schema in the Data Browser, the parent schema will now be highlighted if it contains the selected dataset. Similarly, when collapsing a database, the parent database will be highlighted if it contains the selected dataset |
Additionally, switching between the PROJECT and DATASETS tabs will retain the highlighting of the latest selected dataset or schema and collapse the browser as it was before tab switching. | |
You can also search for datasets in the Data Browser and filter the search results by categories like 'All Datasets,' 'Created by Datameer,' 'In Use,' and 'Scheduled.' Please note that you need to enter a search term first and then select the filter. | |
The Dataset and Project Browser retains its familiar features, including previewing, accessing column and profile information (including calculating column metrics), and utilizing the 'Explore' feature to add datasets to a Project. | |
Workbench | AI support: A new auto-summarization button is now available for transformations, allowing you to automatically update a node's description in the Inspector. |
Grouping Data: We have made visual improvements to the grouping input/output in Datameer. This includes changes to column headers and enhancements in how node relations are displayed. | |
Transformation | AI support: When using the OpenAI feature in Datameer, you can now browse the Data Browser directly within the Notebook. |
Exploring Data | The new 'Explore' menu provides options to rename any exploration, delete or refresh it, and make edits as needed. |
Adding Data | The heading text for the File Upload feature has been updated. |
July 2023#
Topic | What Changed? |
---|---|
Transforming Data | OpenAI Support: You can now use natural language to query, explore and transform your datasets. The 'Notebook' feature allows you to smoothly incorporate SQL into your data transformation process. One significant advantage is that you can save a considerable amount of time by avoiding the need to write intricate SQL commands. |
The schema can be viewed and browsed via the Data Browser to help you answering your questions. | |
Adding Data | Uploading Data: The 'File Upload' header setting text has been improved. |
Cost Control | Usage Report: You now have the opportunity to examine the frequency of usage for each view or table within your schemas. |
Datameer Home | Browsing Data: You can now browse all datasets and Projects from the Datameer Home page. |
The 'Datasets' tab offers a comprehensive view of all available datasets. Easily browse through your datasets to locate the relevant ones for your analysis. Get a quick preview of the dataset contents and explore its column information. This allows you to understand the dataset's structure without the need to open it fully. The 'Datasets' tab provides a convenient option to add data directly to an existing Project or create a new Project for your dataset. Delve into the metadata of each dataset, empowering you to gather valuable insights about the data sources you are working with. | |
The 'Projects' tab now allows you to easily view and search for Projects in your Snowflake environment. You can clone existing Projects to create duplicates with ease or delete Projects that are no longer required. You can now dive directly into any Project to explore its contents effortlessly. Gain insights into your Projects with access to their metadata. Understand crucial information about each Project for better organization and decision-making. | |
As you switch between the 'Projects' and 'Datasets' tabs in the Browser, the asset you had previously selected will be shown, preserving your last selection. | |
Grouping Data | To gain a clearer understanding of the content within a group, you can examine the Input and Output associated with it. When you click on a node's name in the Inputs/ Outputs table, the view will automatically switch to that specific node. |
Exploration | In the Explore dialog's filters section, we have introduced a new AND/OR selector, enabling you to define your filter conditions precisely. By default, the selector will be set to "AND," maintaining the existing behavior to maintain consistency. |
June 2023#
Topic | What Changed? |
---|---|
Node Caching | Working with large datasets and complex Projects is now smoother with the implementation of data caching. It greatly speeds up the development process by reducing the chances of encountering empty previews and accelerating downstream calculations. |
Grouping | When copying a Project that contains groups, all group information is also copied. Expanded groups in the original Project are expanded in the new project as well. If a deployment is part of the group, the new group is created without the deployment node. If the original group consists only of deployments, the group will not be copied since it would be empty. |
Additionally, the context menu for groups has been improved. Double-clicking a group expands it, while double-clicking the group's name label of an expanded group collapses the group. | |
Project | Clicking on an input or output table within a node's name now switches the focus to that node. |
Usability | Preview Data: Previewing data now runs without a timeout and can be canceled by the user at any time. |
Deployment | The green icon indicating successful deployments has been removed from the status display. However, for failed deployments, the icon used now considers usability for colorblind individuals. |
Transformation | Formula Editor: In the Formula Editor, the button "Apply Formula" has been changed to "Preview". There is no longer a need to click "Preview" separately to apply the formula. |
Exploration | When creating compound filters in the "Exploration" dialog, you can now select between "AND" and "OR" for combining the filters. The default selection is "AND". |
May 2023#
Topic | What Changed? |
---|---|
Exploring Data | The Metadata Inspector now provides enhanced metadata for your assets available in the same place you build your transformation pipelines. This empowers you to avoid switching between multiple tools or tabs when updating tags, documentation and custom properties. |
Transforming Data | Formula Editor: After entering a formula in the Formula Editor you can now hit the "Apply" button right away without generating the preview above the editor. |
Cloning Projects | When a Project is cloned, the groups and group information is also cloned. Expanded groups are expanded in the new Project as well. If an output is part of the group, the group is created but without the output node. If the original group only contains outputs, the group will not be copied. |
March & April 2023#
Topic | What Changed? |
---|---|
Slack Integration | You can now integrate your Slack account in order to get informed about team and pipeline activities with integrated notifications from Datameer. This enables you to share query results and debug issues without leaving Slack. |
Snowflake Connection | The Snowflake connection/ disconnection status is now more prominent. The login icon is now highlighted and a blue banner informs the user to first log in to Snowflake. |
Project | You can now view a Project's settings in the Inspector. For that click in an empty space in the Flow Area and view the 'Settings' tab in the Inspector. |
Workbench | Source Node Inspector: A source's node 'Info' tab in the Inspector now contains the fully qualified Snowflake name, that can be copied to the clipboard. |
Grouping: A group can now be renamed from the group's Inspector. | |
JSON Extract | When a schema determination takes too long, the user can now cancel the determination. |
In order to proceed the JSON extract without loosing time, the schema and attribute metrics base on preview data. The full schema can be fetched by clicking the associated button. | |
To save time and money, the Workbench's sample data for an asset is reused when opening the 'Extract JSON' operation. | |
The navigation has been simplified: Clicking on a row in the field pane selects the associated element in the tree pane and vice-versa. | |
SQL Editor | When pressing 'mod-/' a comment can be toggled from now on. |
Deploying Data | The Global Deployment History now displays the table type information additionally to the type icon. |
UX Improvements | All date and time patterns are updated to be consistent across Datameer. |
February 2023#
Topic | What Changed? |
---|---|
Workbench | Flow Area: You can now organize your nodes in the Project pipeline using groups to simplify navigation and maintenance. Grouped nodes can be visually expanded and collapsed. Furthermore you can rename the group nodes to clarify the group's content. |
Inspector: The 'Last Run' field has been updated to show the time zone and success/ fail indicator. | |
Google Sheets | When a Google Sheet is already in use, an appropriate error is now shown for that state. |
If the Google Sheets service account has been removed, the actions for the Google Sheet nodes are fitted: 'Sync Now', 'Modify' and 'Resume' show a pop up depending on whether the user is an admin or not. Configured nodes can be paused and deleted. | |
When a user configures a Google Sheet and exports a Transformation to a Spreadsheet, the 'Datameer Origin' metadata field shows the Project name. | |
Deploying Data | The execution relevant role for the materialization procedure is now shown in the 'Execution History' of a Deployment. |
Sharing Data | Data Grid: The 'Share' pill is now 'Export' to provide all export options out of the Data Grid, e.g. export to Google Sheets. |
Exploring Data | The ‘Explore' tab in the Data Grid now contains ‘Convert to Transformation’ as an option. The ‘Explore’ tab is renamed to 'Exploration' and shows the grabber in hover state. The ‘Exploration’ operation in the operation stack is now renamed to ‘Query Builder’. The former ‘EXPLORE’ node is now named ‘QUERY’ in the Flow Area. All 'Explore' related entries in the Inspector are now named 'Query'. |
All actions for the 'Exploration' tab are now consolidated in one menu. |
January 2023#
Topic | What Changed? |
---|---|
Getting Started | You can now create a new Project immediately when still exploring the sample marketing project. Just click the vertical 'Create New Project' tab on the left. |
Admin | New icons for each 'Admin' subpage improve the visibility of the subpages, e.g. for 'Integrations'. |
Workbench | New prominent visual indicators now show the user which component is being supported for drag and drop, e.g. the 'Explore' tabs from the Data Grid or datasets from the Data Browser. |
When switching between the 'Workbench' tab, 'Scheduled Shares' and 'Project Details', heading back to the Workbench tab selects and highlights the last added node in the Flow Area. | |
When dragging and dropping an exploration from the Data Grid to the Flow Area, the new node now appears in the Flow Area and is highlighted. | |
Google Sheets Integration | When the Google integration is not yet configured properly, clicking on 'Send to Google Sheets' from the 'Share' menu now provides helpful advice to solve the problem. |
Transformation | JSON semi-structured data update: You can now visually inspect the schema of your semi-structures data and destruct complex values. |
Join - You can now select both join columns in the 'Join' configuration from the drop-down. | |
Sharing | The email when a zero data scheduled dataset has been sent, has been improved. Now, name, link of the dataset as well as the Project are named in the mail. |
When a scheduled email exceeds the maximum file size and therefore can't be delivered, a note is being displayed for that error state. | |
UI Improvements | The 'Refresh' and 'Edit' buttons for sharing and exploring are now consistent in the node menu to match a better user experience. A 'Refresh' button is now added to the Inspector. |
The wording in the 'Configure Scheduled Email' has been improved to clarify that the recipient must be a email address. |
December 2022#
Topic | What Changed? |
---|---|
Getting Started | A Datameer trial user can now choose to start exploring an example Marketing Project or create a new Project. Choosing the example Project leads to a detailed Marketing Project which guides through Datameer's features and welcomes the user to explore the whole Project. Choosing the 'Create a new project' option, opens the empty Project and supports the user with the 'Onboarding Checklist'. The checklist's wording has been improved as well. |
Transforming Data | Light Data Preparation: Union - Clicking the floating + button, the input field in the 'Add Dataset' card is now focussed. |
Exploring Data | Clicking the + button next to the 'Explore' pill in the Data Grid now opens the exploration dialog immediately. When configuring, you can create the query without selecting any filter, aggregate or sort as of now. This will will result in a 'select * from source limit N' where N is our current exploration limit. By default all columns will be shown. The Data Preview is limited to a maximum of 1,000 rows. The button states for 'Explore', 'Share', 'Download', and 'Refresh' including tooltips are updated. |
After exploring data from the 'Explore' tab, there is a capability to directly move an exploration node to the Flow Area via the 'Share' menu to progress working on it later on. When scheduling an exploration via Email, the associated exploration node is displayed in the Flow Area. You can do the following: edit the exploration, show the exploration details, deploy the exploration data to Snowflake and/ or delete the exploration. | |
Sharing Data | Google Sheet Integration: You can now share Datameer datasets from a Project to Google Sheets featuring maintaining the lineage as well as history of the sent datasets. Furthermore you can synchronize the shared datasets on a schedule. |
The sharing data to Google Sheets configuration must be set or edited via an Admin on the 'Admin' page. The Google Sheet must be configured to be shared with Datameer and the shared Datameer dataset must be configured to be shared with the Google Sheet URL. | |
A user will be notified when any upstream schema change happened for a schedule. He will be advised to review the change and adjust the schedule accordingly. | |
When a share fails, the user will be informed about the sending error immediately. | |
UI and Wording Improvements | The wording for the 'Schedule Email' dialog has been updated. |
November 2022#
Topic | What Changed? |
---|---|
Getting Started | The 'Getting Started' wording has been updated to help the user exploring all basic Datameer features when working with Datameer for the first time. |
Workbench | Data Grid: Columns that shall be shown in the Data Grid can now be selected easily from the dropdown in the 'Preview', 'Columns' and 'Exploration' tab. |
Transformations | Light Data Preparation: You can now append the content of a dataset to the content of one or more datasets to create a new combined dataset. Within the 'Union Configuration' dialog, you can add easily new datasets from the current Project and configure the union feature. For each input dataset Datameer shows how many unique columns out of the total are mapped to the output. Help texts and hover texts help you to use the 'UNION' function as expected. |
The Inspector is also reflects information about applied unions, e.g. the used combine mode and a list of the used datasets. | |
Exploring Data | The 'Explore' configuration has been improved by renaming the dialog and providing help texts. |
If an Exploration query is out of sync, it is not possible to download the data until the query is refreshed and the user gets help on how to refresh to out-of-sync exploration query. | |
Providing Data | You can now select to send either an email that is sent just in time or schedule email sending. |
Scheduled emails are now displayed as associated email nodes in the Flow Area. A schedule can be modified, paused, resumed and deleted. | |
When an email fails to be send, the sender is informed about the error in the snack bar. When a dataset with zero records has been sent in a scheduled mail, the recipient gets informed about the empty dataset via email as well as the sender. | |
Usability | You can now confirm a dialog with the 'Enter' key instead in order to close a dialog which must be confirmed with 'Apply'. |
UI and Wording Improvements | We did some design and wording improvements in the Data Grid. |
Workbench: We provide hover texts for all Workbench tabs to describe all features when exploring Datameer. |
October 2022#
Topic | What Changed? |
---|---|
Getting Started | Onboarding Checklist: You are now guided through the main Datameer features by the onboarding checklist with the following steps: Create a Project, add data, transform data, share insights and connect to your Snowflake. The onboarding checklist can be hidden temporarily. |
Security | For the trial, you now need to set a password with at least 8 characters. |
Workbench | De-duplication: If a user de-duplicates column names, e.g. during a REPLACE, SPLIT or EXTRACT, the new columns contain an data2022 -> data_2022_split1 . For a SPLIT on data2022_split1 -> data2022_split1_split1 and data2022_split1_split2 . If there is already a data2022_1 column, data2022 -> data2022_2 . |
Data Grid | Filtering on a VARIANT column is now supported. |
When exploring, you can now filter the results of a summarized exploration query. | |
When selecting a cell in an 'Explore' tab, the row header is now highlighted. | |
Transformation | JOIN: If no suggestion could be found, an information about that is now shown during the JOIN configuration. |
JSON Support | You can now enjoy the improved JSON details with proper hover labels for cover and insert buttons, a 'View' section and a 'Path' section in a new design. |
Downloading Data | You can download a dataset as a CSV from any 'Exploration' tab. |
Admin | Snowflake Settings: Clicking "Edit" next to the account name on the 'Snowflake Settings' page allows you to change your Snowflake configuration. |
September 2022#
Topic | What Changed? |
---|---|
Project | You can now delete a Project via the kebab menu on the Datameer Home page. |
Flow Area | The 'Remove all unused datasources' option can now be executed from the context menu of an unused node in in the Flow Area. |
Data Grid | 'Explore' tab: All validation and exploration options (Filter, Summarize, Sort) are now organized in one 'Explore' tab. You can explore more than one dataset for all sources, intermediate datasets and deployed data in this tab. |
'Explore' tab: The data filter dialog has been extended to provide more specific operators to allow multiple comparisons. An operator drop-down provides multiple operators, e.g. is null, is empty and columns can now be used as a comparison value, e.g. WHERE. | |
'Explore' tab: You can now filter for 'IS NULL' and 'IS NOT NULL' in the filter dialog as well. | |
'Profile' tab: You can now only show the columns with computed metrics via hitting the 'See Only Calculated' box to compute another column's metrics as well. | |
'Profile' tab: Calculated metrics can now be minimized and maximized. The minimized state provides the subset of the computed metrics. | |
'Profile' tab: The Top-K charts are now showing up to 64 terms, the Histogram now attempts to show 32 buckets. A string column Top-K chart now shows up-to 64 terms no matter what he actual cardinality of the column is | |
'Preview' tab: A runtime time duration information is now shown when crunching the latest data. You can cancel the running preview query and re-run the data preview to show the latest results. | |
Uploading Data | When uploading a CSV data, the lines with errors are dropped in an associated ' |
Transformation | Light Data Prep Operations: When using the 'Filter' operation you can now select the column from an alphabetical ordered drop-down. Searching for a column also works when entering partial value also. |
Light Data Prep Operations: The filter operations 'is empty' and 'is not empty' are now available when using the 'Filter' operation. | |
New TRY_* functions are added to the 'Conversion' operations. | |
SQL Editor | New 'Windowing' functions are added to the SQL functions list. |
Deployment | Only the schemas with the respective permissions to deploy to are listed in the destination list in the 'Deploy to Snowflake' dialog. Also the number of available schemas can be viewed and a link to the docs if you are not finding the schema you were looking for. |
Providing Data | A user can now schedule sending an email with a CSV dataset besides sending an email just in time with the results. |
You can now configure the maximum file size for sharing a CSV file via email. The default value is set to 25 MB. Admins can change the value via a custom property. | |
Deleting data | Deleting a Project now deletes deployed datasets from both Datameer and your Snowflake instance. When a deployed dataset is referenced in another Project, a list of reference Projects is shown with a warning, that the reference will break. |
After deleting a deployed node from the Flow Area, the parent node is now automatically selected. | |
Navigation | You are now guided back to your previous page after clicking the browser's 'Back' button on the 'Settings' page. |
Performance | The handling of many wide and large datasets in one Project is now improved. |
UI Fixed and Improvements | Some UI fixed are applied to both the 'Deployment History' and 'Execution History' dialog. |
When having complex pipelines in the Flow Area, overlapping edges are now prevented. | |
When opening a dataset that is used by more than one Project, the dataset is highlighted in the Flow Area for better orientation. | |
Error Handling | When a node is in an error state, the associated error messages for a not existing or renamed column are improved and consistent. |
When the file size limit exceeds, a meaningful error message is now shown. |
August 2022#
Topic | What Changed? |
---|---|
Project | When changing the Project owner, the user is now informed via a confirmation in the snackbar. |
Data Grid | The Data Grid design has been improved to present more information about the current dataset, e.g. a more prominent tab design and the option to enlarge this area. Also the actual tab in the Data Grid is being remembered when navigating between the nodes in the Flow Area. Once loaded, a dataset does not have to be reloaded again while switching between different nodes. A refresh button allows you to refresh the data manually. |
Transformation | The aggregate functions 'Any', 'approxCount', 'Avg', 'Count', 'Kurtosis', 'Max', 'Median', 'Min', 'Mode', 'Skew', 'StdDev', 'StdDevPop', 'Sum', 'VarPop' and 'Var' have been added. Numeric source columns are required for: 'Median', 'Stddev', 'Stddev_Pop', 'Var_Pop', 'Var_Samp', 'Kurtosis, Skew'. |
You can now edit a transformation out of the context menu directly from the respective node in the Flow Area. An icon next to the operation in the Inspector’s operation stack indicates that the transformation is editable as well from there by clicking on it. | |
Schema changes are now propagated through the downstream. That means, columns are being renamed and updated automatically to the new name, SQL changes as well. | |
Workbench | You can now bulk remove unused data sources from the Flow Area via the context menu: 'Remove unused data sources'. |
The former button 'Delete' is now named 'Removed' in the Flow Area menu to ensure that removed datasources are not deleted but can be added to the Project later on again. | |
SQL Editor | The user will be warned if he leaves the SQL Editor without confirming or updating the query. |
When generating production SQL queries, the table reference CTE(s) is/ are not used but the tables are referenced directly. Intermediate nodes are still CTE(s). | |
Deployment | The user's timezone is now displayed next to the time of the next run when scheduling a deployment. |
You can now copy the SQL statement via a 'Copy to Clipboard' button in the Deployment History. | |
The 'Assign new Destination' area is now enlarged to provide more scrollable deployment destinations. | |
UI and Wording Improvements | We did some design and wording improvements for consistent dialog content as well as for help texts. |
The Home Page and Search Results page now come with dark-themed entity cards. | |
The visual design of the Flow toggle button to show or hide unconnected nodes is now more prominent to inform you about the current view state. |
July 2022#
Topic | What Changed? |
---|---|
Workbench | The whole 'Exchange Source' workflow comes with an improved UI: The current source is listed and all available new source options can be selected from a drop-down or via a click in the Flow Area. A warning banner informs you if the new source would not match the schema. |
Several UI improvements enhance the Workbench tabs, like 'Profile' tab now contains the total row and column information and the 'Explore' tab has improved wording and interaction buttons. | |
Administration | The 'Snowflake Settings' page has been reorganized. The 'Advanced Snowflake Properties' as well as the option to reconnect to your Snowflake can now be found in the kebab menu on the upper right hand corner. The current Snowflake user login credentials are now displayed and can be changed via the 'Edit' button that appears only when hovered. Schemas can now be updated via the 'Refresh Schemas' button on top of the schema section. |
Formula Builder | Each new column that contains a user formula has now a leading formula icon in the column header to determine applied formulas. |
Data Grid | You can now manually refresh the column metrics for each column to ensure the latest results. |
The design and order for all tabs in the Data Grid has been improved. The new order is: Columns, Preview, Profile, SQL and Explore. The Data Grid area can also be minimized or maximized. | |
Transformation | You can now edit already applied transformations from the context menu of a node or double-click a node to edit the transformation. Transformations from the 'Recipe' can be edited via the 'Edit' icon within the recipe. |
SQL Editor | The 'Aggregation' functions, e.g. AVG, MEDIAN, COUNT, ... have been added to the SQL Editor. |
SQL nodes can now be renamed. When renaming the nodes, a warning will be shown. | |
Deployment | Both the 'Deployment History' and the 'Execution History' are limited to 100 entries and a length of max. 1,000 characters to avoid long loading times. |
June 2022#
Topic | What Changed? |
---|---|
Workbench | The whole 'Profile' tab in the Data Grid presents all information and column metrics. |
You can now exchange the data source in a Project although a full pipeline has been created yet. | |
The 'Show all Nodes' button in the Flow Area is now more prominent when entering the 'Show only connected nodes' mode. | |
Schema changes are now propagated through the downstream and don't break the pipeline anymore, e.g. when a column is renamed in a node that is used in a further transformation. | |
Unused data sources can now be bulk-removed from the Flow Area. | |
When navigating between nodes in the Flow Area, the selected 'Preview' tab in the Data Grid is remembered and active when switching back to the node. | |
Transformation | New aggregate functions are available: Median, Mode, Stddev/ Stddev_Samp, Stddev_Pop, Var_Pop, Var_Samp, Approx_Count_Distinct, Kurtosis, Skew. |
Navigation | Closing a schema details page directs you back to the previous page from where you navigate to the schema details page. |
Metadata | When changing the ownership of a node, a snack bar now confirms this ownership change. |
Deployment | When scheduling, the user's time zone is displayed next to the 'Next Run' entry. |
The SQL statement is now available in the Deployment history and can be copied. | |
The 'Assign new Destination' section in the 'Deploy to Snowflake' dialog is now enlarged for better usability. | |
You can now investigate the Development and Execution History for each deployed node in the Inspector. | |
UI Improvements | We did some design and wording improvements for help texts and previews. |
May 2022#
Topic | What Changed? |
---|---|
Homepage | The Datameer product Homepage now comes with the dark theme. Asset cards are now dark-colored in order to improve the readability. |
Project | A Project can now be cloned by an admin and a Project collaborator from the Datameer Homepage as well as from the Project details page. Cloned are the data sources as well as all transformations. Not cloned are the published assets, scheduling configuration and Project descriptions, tags and properties, collaborators as well as the create and modified date of the original Project. The cloned Project is owned by the user who cloned the Project. |
Flow Area | The naming convention for Datameer views in the Flow Area has changed. New nodes are now named based on the applied transformation, e.g. when filtering data, the new view is called 'FILTER'. If the Project has multiple operations of the same transformation type (e.g. filter), the nodes are named sequentially, e.g. FILTER, FILTER 2, .. |
The banner below the Flow Area now shows an adapted message for the data preview and sampling functionality. | |
Data Browser | An updated message is now shown in case the Data Browser shows no entries because no datasets were added. |
Functions | JOIN recommendations can now be triggered manually and are not presented by default. |
SQL | You can now view and copy the formatted SQL for each transformation node including all upstream SQL transformations. To copy the SQL query, select 'Copy to Clipboard' in the SQL tab from the Data Grid. |
SQL comments are now presented in a lighter font color for better readability. | |
Details Page | If no data can be loaded due to an immediate error or an error on an upstream dataset, the error on the details page now shows up directly below the dataset name. |
General UI Improvements | We did some design and wording improvements for error messages, dropdown chevron and hints. |
April 2022#
Topic | What Changed? |
---|---|
Projects | Errors are not shown anymore when a Project name has a whitespace at the beginning or the end when typing the Project's name in. |
Workbench | The Inspector now shows the 'Data Origin' information for a published asset. |
Data Browser | Confirming bulk loading datasets from a schema into the Project prevents loading a lot of datasets by mistake. |
Transformations | A user now sees a note that updating a view is prohibited when editing any operation would break the updated view. |
SQL Editor | You can now build your SQL queries quickly based on the auto-complete functionality. |
We now provide a list of all available SQL functions in the SQL Editor to optimize generating SQL queries quickly and well and use the right syntax for that. | |
An improved error message informs the user of duplicated column names more precisely in the SQL Editor. | |
SQL errors now show the correct position in the query when editing downstream SQL operations. | |
Uploading CSV | Column headers are now recognized and specified during a CSV upload. |
Finding Data | A user can now search for Projects, datasets, schemas and source locations on the Datameer Home page without listing intermediate views. |
Publishing | A user now gets a visual indication to which schema he may (depending on the permissions) publish a view or table to during the 'Publish' workflow. |
Tables are now shown with the 'table' icon and views with the 'view' icon in both the Data Browser and the Flow Area. Furthermore the inspector now shows the information whether a published asset is either a view or a table. | |
An improved error message now warns you when republishing a view to a suspended Snowflake warehouse. | |
General UI Improvements | We did some design and wording improvements for dialog boxes, buttons and hints for first time users. |
Performance | When a query leads to a too a non-scalable response because of a too large response, the user is now informed via an error message. |
March 2022#
Topic | What Changed? |
---|---|
Sign Up | When signing up the free trial, you now get more information about the sign up progress, e.g. how long it may take until the instance is spinning up. |
A default warehouse must now be defined for the Datameer service account during the initial setup. The warehouse can be changed in the 'Snowflake Settings'. When selecting a warehouse from the drop-down, it is validated and throws an error, if it is suspended or a warning, if the selected warehouse size is the smallest. | |
The Datameer trial terms and conditions are updated. | |
Data Browser | Uploading a CSV can be done in the Data Browser via the 'Upload CSV' button. More delimiters for the CSV upload are now supported, e.g. pipe, semicolon .. |
Workbench | Source or published datasets that refer to Snowflake can not be renamed. Renaming is allowed only for transformed datasets within Datameer. |
UI | Tables and Views are now associated with the matching icons in the Workbench UI. |
SQL Editor | Complex columns (ARRAYS, OBJECT; VARIANT, GEOGRAPHY) can now be viewed, imported and used in Datameer. They can now be used in the SQL Editor. |
The SQL Editor provides now highlighted basic syntax (keywords/ functions/ numbers/ string literals/identifiers) with a colorblind safe palette. | |
Improved contextual auto-complete supports the user to write SQL queries more simple in the SQL Editor. | |
Common Table Expressions are now supported. Views with CTEs can now be easily combined with operation stacks and other SQL views. | |
Publishing | Cron expressions are available again for scheduling publishing tables to Snowflake. |
When investigating your publish history, the list is now displayed in descending order. The newest entry is listed first. | |
Troubleshooting | When losing the connection to Snowflake both a yellow frame as a visual indicator and a warning message with a hint how to solve the problem are displayed. |
Performance | An optimized execution infrastructure allows to execute unqueued operations. |