2.6 Conclusion

This chapter introduced the Absenteeism at Work dataset — a rich, real-world dataset that captures three years of absence records from a Brazilian courier company. You now know what each variable measures, the types of data involved, and the business questions this data can help answer.

In the next chapter, we will set up the tools you need to start working with this data: the R programming language and the RStudio development environment. By the end of Chapter 3, you will be ready to load, inspect, and begin exploring the dataset yourself.