Project Introduction

Tiger's Candy, a candy store that originated on the RIT campus, has rapidly gained popularity. To handle their growth, they have decided to build an automated data system for processing online orders in batches.

As their new database engineer intern, you will work with a data system blueprint designed by Tom, a part-time engineer. Your role is to continue developing this system and load data according to specifications.

In this project, you will implement batch processing logic to process raw order transactions at the end of each day. This process includes validating transaction details and verifying inventory levels to ensure successful order shipment. You will then put the daily sales and profit numbers into a time series forecasting model to predict future sales and profits.

Dataset Description

  {
    "transaction_id": 73434473,
    "customer_id": 29,
    "timestamp": "2024-02-02T12:00:40.808092",
    "items": [
      {
        "product_id": 17,
        "product_name": "Sea Salt Crackle Enrobed Bites",
        "qty": 5
      },
      {
        "product_id": 18,
        "product_name": "Almond Shards Enrobed Bites",
        "qty": null
      },
      {
        "product_id": 3,
        "product_name": "Powdered Sugar Sticks Rectangles",
        "qty": 2
      }
    ]
  },

Requirements and Grading Rubric (100 points)

Tips

  1. Take time to thoroughly review the data and requirements before implementations
  2. Start with small, manageable pieces - one file or a section of one file
  3. When results don't match expectations, manually verify those data points
  4. Data accuracy is the most important thing
  5. Learn to solve problems independently
    1. When you get stuck, write down the specific problem you need to solve, then you will know how to solve it
    2. Don't hesitate to ask for help with issues you can't resolve
  6. Track your progress and look for ways to work more efficiently
  7. Take a deep breath and stay calm 😉 - this project is meant to challenge you

Important Notes