NoSQL with Accumulo training in Washington DC
George Mason University, Volgenau School of Engineering

TAIT 0528: NoSQL with Accumulo

Overview

Accumulo is an open source NoSQL database that follows a sorted, distributed key/value store based on the BigTable technology from Google. Written in Java, Accumulo has cell-level access labels and server-side programming mechanisms.

This hands on course is designed to teach and implement Accumulo’s unique feature about “Data-Centric” security features. Data today is transformed and reused for different analysis applications, it’s advantageous for the database to keep track of itself on who is allowed to see the data, rather than repeatedly implementing rules in each application that uses this data.

Back to top

Audience and Prerequisites

This course is designed for individuals who have prior Java programming experience.

Back to top

Course Outline Detail

Accumulo Background
A history of NOSQL
A survey of lightly structured stores
Design drivers for Accumulo

Installation and Startup
Environment setup
Basic Accumulo configuration
Running process control scripts
Using Accumulo administrative tools (shell and monitor)

Overview of Accumulo Architecture
Defining the sorted Key/Value space
Range selection and filtering
Table/Tablet organization
Processes and inter-process communication
Control and data flow for read and write operations

API Introduction
Keys, Values, and Mutations
Instances and Connectors
BatchWriter, Scanner, BatchScanner

Application Design
Diagramming table schemas / flexible schemas
Basic indexing theory
Information retrieval design patterns
Joins and pre-joins

Advanced Topics
Hadoop ecosystem integration
Relational operations on Accumulo
Iterators

Advanced API
Iterators
Constraints
Bulk load
ACID/BASE semantics

Cell-level security
Defining domain-specific authorizations
Trust boundaries

Partition Management
Column- and Row-orientation
Row schemas
Locality groups

Information retrieval
Joins
Document-distributed indexes
Partitioned joins with the IntersectingIterator

Statistics

OLAP cube shells
Query-time vs. compaction-time aggregation

Additional Applications
Graph search
Machine learning
Geohashing

Relationship to relational database
Data definition languages
Pre-joined indexes

Custom iterators
The SortedKeyValueIterator Interface
Filters and Combiners
Lookup Iterators and seeking

Performance optimization
Hot spots and bottlenecks
Managing parallelism
Troubleshooting

 
Registration

Click here to download the registration form (fax or mail)

Schedule

Please call for our upcoming class schedule.

Tuition

$1,800

CEUs
3.2 CEUs
32 Hours
Onsite Opportunity

Enhance your organization's competitive edge!

George Mason University's TechAdvance Program can tailor programs to meet your organization's needs. Companies or agencies interested in bringing this program on site should contact TechAdvance at 703-993-1551.

Contact Info.
Address:

  George Mason University
TechAdvance
Volgenau School of Engineering
  3351 Fairfax Drive, Suite 448
  Arlington, VA 22201

Telephone: 703-993-1551
Email: advance@gmu.edu
Volgenau School of Engineering George Mason University