

Welcome to
bigdataX
Join us as we embark on a mission to create a sharing community
to build up practical data processing literacy in Singapore
and around the region!
Upcoming
Happy holidays!
see you in jan 2020!
Recent meetup

Archives (Videos & resources)
Check out the videos and tutorials from our past BigDataX meetup workshops (Chronological order)
Introduction to Hadoop File System as storage for Big Data
Presented by Nick Choy (28 Jun 2018)
Pre-Work
Cloudera Hadoop Installation
Pre-Work
Setup Cloud
Dataproc Cluster
Pre-Work
Load MySQL Sample Employee Database
Slides
Slide Deck
Video
Part (1/2)
Video
Part (2/2)
Demo Activity 2
Ingesting RDBMS
data into HDFS
Demo Activity 1
Ingestion
Spark 101 (Theory and Hands-On)
Presented by Divya Gehlot (26 Jul & 29 Aug 2018)
Pre-Work
Setup Guide
Video
Spark 101 (Theory)
Slides
Slide Deck
Python Spark Tutorial
Presented by Natalino Busa (13 Sep 2018)
Pre-Work
Setup Kubernetes
cluster
Video
Apache Drill Workshop
Presented by Divya Gehlot (24 Oct 2018)
Pre-Work
Apache Drill Installation
Activity
Workshop Guide
Slides
Slide Deck
Datasets
Workshop Datasets
GraphAI - Machine Learning on Network Data
Presented by Gabor Benedek (29 Nov 2018)
Video
Divya Gehlot
I am a Hadoop Pioneer user and passionate learner of big data technologies. Experienced playing with structured financial data to unstructured telco data. Worked on almost all the big data technologies. Firm believer that data is the most valuable asset and right data has power to move the world.
Meet The organizers
Kenneth Leung
I am a licensed drug dealer (pharmacist) and data analyst in a Singapore public hospital. Am also an avid learner and doer, and always keen to upgrade and upskill in order to transform pharmacy (and healthcare) for the better. Outside of work, I enjoy reading, calisthenics, and whisky neat.
Vicky Miao
I’m an IT Business Analyst and CPA holder with double degree in Accounting and Statistics. Terminology comes and goes, but the constant is a data explosion and the need to make sense of it, especially for data-intensive industries like financial services and IT. BigDataX is a group of data enthusiasts who are eager to learn and share the latest big data techniques. Let’s grow together and have fun along the way!
Kenneth Leung
I am a licensed drug dealer (pharmacist) and data analyst in a Singapore public hospital. Am also an avid learner and doer, and always keen to upgrade and upskill in order to transform pharmacy (and healthcare) for the better. Outside of work, I enjoy reading, calisthenics, and whisky neat.
Vicky Miao
I am an IT Business Analyst and CPA holder with double degree in Accounting and Statistics. Terminology comes and goes, but the constant is a data explosion and the need to make sense of it, especially for data-intensive industries like financial services and IT. BigDataX is a group of data enthusiasts who are eager to learn and share the latest big data techniques. Let’s grow together and have fun along the way!
Paul Lorett Amazona
I am a developer who has served several years in the investment banking domain delivering end-to-end .NET solutions. I am passionate in using data science, technology, and AI to solve problems. The world is full of exciting problems and Big Data tech is a great tool to have in our arsenal.

Rachel Yen
I am a Pharma researcher turned Business Analyst. I am passionate about AI/ML
(Artificial Intelligence and Machine Learning) in Healthcare.

Tan Rui Qing
I am your average process-oriented strategist with a background in IT and Museum Education. I believe in 'Humans first, Technology second'.