Friday, July 28, 2017

Week 10

Week 10





The plan for this week was to prepare slides for "The Download" tech talk, unify categorical and continuous trees into the original push and parallelize classification. The plan was also to fetch data from Scopus.

Day 1:
  • Work on slides for the Tech Talk.
Day 2:
  • Practice Talk for the tech talk
  • Combine Categorical and Continuous Trees with original Gradient Boosting Trees framework
Day 3 and 4:
  • Scrapper to fetch data from Scopus.
  • Reading on projections and clustering vectors.
Day 5:
  • Few optimizations to code and fix code-style violations.
  • Finish up few touch ups for the slides for the tech talk.

Friday, July 21, 2017

Week 9

Week 9





The plan for this week was to test continuous and categorical data for gradient boosting and integrate regression and classification tree on larger datasets.

Day 1:
  • Did a study on the dataset provided by Roger Dev. It had 5000 records, 52 attributes, and 7 classes.
  • Developed script to read this data into ECL and verify the algorithm.
Day 2:
  • On a parallel note spent this day studying Scopus data as part of my parallel project in connecting legislative and research documents.
Day 3:
  • The dataset took a large time to run.
  • Spent the rest of the day debugging the problem.
  • Fixed the issue.
Day 4: 
  • Wrote a naive scrapper to parse data from Scopus.
  • Need to fetch large data. Planning over the weekend.
Day 5:
  • Started working on slides for Tech Talk on Aug 1st

Friday, July 14, 2017

Week 8

Week 8





The plan for this week was to develop a decision tree to handle both continuous and categorical data for gradient boosting and integrate regression and classification tree.

Day 1:

  •  Addressed feedbacks from my mentor on commits for Regression and Decision Trees

Day 2:
  • Developed the stub methods and creating generic modules for future implementations of mixed decision tree regression.
Day 3:
  • Combining Splitting techniques for regression and classifications
Day 4: 
  • Implement mixed trees
  • Plugin mixed trees to gradient boosting
  • Test for Gradient Boosting using mixed trees for classification and regressions
Day 5:
  • Field Type generator to easily use default field types
  • Community Service to Food bank for 3/4th of the day.

Friday, July 7, 2017

Week 7

Week 7



A relatively short week due to the 4th of July break.

Day 1
  •  Implement tests for all the algorithms implemented over last six weeks
Day 2: 
  • Fix few leftover tests
  • Send code for review to Dr Holt
Day 5:
  • Fix recommended changes and sent code back again.