I am a Data Engineer at 84.51. I enjoy working with all parts of a Data Science stack, primarily ETL, data modeling, reporting building and statistical analysis. I spend most of my time in python, Data Bricks, Snowflake, and Automic
Some things I’ve worked with
Data Engineering Tools
Python
Pyspark to create data frames from parquet and csv files to build data pipelines
Used in API development and API interaction
SQL
Have been using SQL since 2007. Familiar with stored procedures, triggers, use in ETL packages, joins, group bys, CTE etc.
Databricks
Created jobs
Database and Data Warehouse Relevancy
Snowflake
Mostly database & object creation and queries. Familiar with external table structures connected to Azure storage objects
SQL Server
Have used SQL Server since 2007 in various capacities of managing a data warehouse, database administration, backups, Job Agents, user management, etc.
Python
Pyspark to create data frames from parquet and csv files to build data pipelines
Used in API development and API interaction
SQL
Have been using SQL since 2007. Familiar with stored procedures, triggers, use in ETL packages, joins, group bys, etc.
Oracle Goldengate
Used to replicate CRUD operations from production database to SDS
PowerBI
Built dashboards and reports using direct query against a data warehouse
ODI (Oracle Data Integrator)
Build ETL pipelines from Peoplesoft to Student Data Warehouse
OBIEE (Oracle Business Intelligence Enterprise Edition)- Used OBIEE since 2015 to build reports from data models. Very familiar with all OBIEE front end tools including dashboards, filters, column formulas, BI Publisher, etc. Also familiar with backend management of users and security Developed in the three RPD layers
Oracle 11/12 – Some database administration experience with ASM, parameter files, tablespaces, datafiles, user mangement
SSIS
Built ETL pipelines for student data warehouse from different data sources
Tableau
Built Dashboards academically and recreationally
System Administration and Automation
Amazon Web Services
Used to host external database for Senior Project, used for a client to install Canvas and host it.
Azure
Managed resource groups, key vaults as part of hosting an API in the Azure cloud space
Unix/Linux
Currently use Linux to manage edge nodes for ETL pipelines
Used Linux to Manage OBIEE servers and report development source control and Oracle Goldengate, BASH scripting, awk, experience as a Linux Systems Administrator as a primary role from 2007-2014.
Commvault – Installed on a Windows Server and configured agents to backup discs to other discs and also added external media (tapes). Set up retention policies for each media type.
Jenkins – Use Jenkins to automate our Github builds as well as perform unix commands on remote servers
Puppet – Used for a few months at a job. Familiar , but would need a refresher
Vmware – Spun up VMs, migrated them between hypervisors, adjusted memory and other virtual machine parameters
Xenserver – Spun up VMs, migrated them between hypervisors, adjusted memory and other virtual machine parameters
Zabbix – Setup agents to be monitored as well as created triggers and items to be monitored
Higher Education
Campus Solutions
Blackboard Learning Management Systems
CourseEval
Canvas
Data Science Mostly School Related
Rockwell Arena-