Introduction
Datameer is a complete end-to-end solution for data analytics. Datameer is built on top of Hadoop, an open source technology that supports the storage and processing of large data sets in a distributed computing environment. Datameer tools simplify extraction, transformation and loading and real-time data retrieval. For business users, Datameer offers a familiar spreadsheet interface and intuitive data visualization.
You need to know a Unix-based system, (such as Ubuntu Linux 9.10) and know details about accessing where your data is stored such as IP addresses, local host information, user names, passwords, and so on.
See System Requirements for details on the hardware and software requirements and the required information to access your data using Hadoop.
This guide describes how to use Datameer and covers the following:
- How to configure data stores and link to data stores
- How to set up import jobs
- How to view jobs
- How to configure the mail server
- How to configure the Hadoop cluster
- How to configure database drivers
- How to export data
- How to set up and manage users and groups
- How configure Datameer to use Jetty with SSL enabled
To learn about how Datameer works, see the Concepts Guide and see Datameer Video Tutorials.
To learn about extending Datameer functionality, see the Developer Guide.
To learn more about Hadoop, see Hadoop and Datameer.
Overview
Datameer provides a wizard-based integration to all data - structured, semi-structured and unstructured. Integration is fast and easy because there is no need for explicit data and schema mappings.
Let's get started. As a system administrator, you will be spending most of your time using the features located by clicking the Data tab or Administration tab at the top of the page.
Click the Administration tab to view the system dashboard--a quick at-a-glance look at all running systems. You can also set up authentication, groups and users, jobs, and configure the mail server, Hadoop Cluster settings, and configure the database drivers and plug-ins used to connect to your data. This is where you will initially configure the Datameer system.
Click the Data tab to access where you will set up import jobs, export jobs, and configure data stores. Once you have the system up and running, this is where you will spend more time.
Viewing the System Dashboard
To view the system dashboard, click the Administration tab on the top of the page. The System Dashboard tab is selected by default.
The list of current running jobs displays. You can view job details and run specific jobs.
Viewing Jobs
To view jobs, click the Administration tab on the top of the page, then click the Jobs History tab on the left. From the summary page, you can view the list of jobs, when they last started, finished, the run time, and the run status. Position the cursor over the run status icon for a description. As your use of Datameer grows, use the controls below the table to view additional pages of jobs.
From here you can view job details including the job status, job history, and job log file. You can easily see counts of how many records are processes, and how many records are dropped (if any).

View Job Details
You can view the job history, view the log file, or download the log file. You can also run a job or view the source details.
To view job history, view the log file, or download the log file
- Click the Administration tab on the top of the page, then click the Jobs tab on the left.
- Click the radio button next to name of the job and click the Details button.
- The Counter History shows how many records are processed and how many are dropped.
- You can specify how many lines of the job logfile display. Use the scroll bar to view the log. (See Logfile details.)
- You can also click the Download Logfile link to download the log file.
- Use the play or pause controls to run or pause the job.
Logfile details

To run a job:
- Click the Administration tab on the top of the page, then click the Jobs History tab on the left.
- Click the name of the job and click the Run button.
The job is queued, runs, and then displays the results.
Depending on the volume of data, a job may take some time to run.
To view source details:
- Click the Administration tab on the top of the page, then click the Jobs History tab on the left.
- Click the name of the job, and click the Workbook Details button.
- From here, you can edit the workbook, by clicking the Open button. See Working with Workbooks to learn more.
- To view the complete data set, click the View Latest Results button. You can click the tabs to view each sheet in the workbook. You can scroll through the data, or click the Go To Line button to view a specific record. From here, you can click the Open button to edit the workbook.
- From the Workbook Details page, click the Details link to view details about the job.
To Show Me How, see Using Go To Line.
