Department
Information Technology

Data science & Analytics lab focuses on methods, processes, algorithms and systems to extract knowledge or insights from data in various forms, either structured or unstructured.  It also aims at giving exposure to students regarding  analysing large-scale text streams such as news, blogs, and social media to identify trends by using Open Source Frameworks available. Various Open Source databases like MySQL, PostgreSQL, Oracle, MongoDB, SQLite, CouchDB, Data Analysis Frameworks   like Pentaho, WEKA, RapidMiner are available for students to work on.

 


Lab Objectives: Students can Utilize this laboratory to


1.Construct problem definition statements for real life applications and implement a database for the same.


2. Design conceptual models of a database using ER modelling by using Opensource tools like Dia, Umbrello for real life applications and also construct queries in Relational Algebra.


3. Create and populate a RDBMS, using MySQL, PostgreSQL as well as Analyse and apply concepts of normalization to design an optimal database.


4.Implement the appropriate data mining methods like classification, clustering or association mining on large data sets using open source tools like WEKA, Pentaho, RapidMiner.


5.Demonstrate capability to use Big Data Frameworks like Hadoop &  Program applications using tools like Hive, pig, , NO SQL and MongoDB, CouchDB for Big data Applications


 6. Design and implement algorithms to analyze Big data like streams, Web Graphs and Social Media data and construct recommendation systems.

Unique Features: Use & Accessiblity of Remote Servers, E-learning with Moodle, Virtual Classroom & Web Conferencing facility, Online  Lab Assignment Submissions, Online Assesment  & grading with feedback, Cloud Storage, Online Tests for self-assessment, Authentication based Internet & Printing Facility, Centralized Power Backup.

405b405a