--- title: "Python Dataclasses: Derived Fields and Validation" date: 2024-01-15T11:02:21-05:00 draft: false tags: - Python math: false medium_enabled: false --- Python dataclasses provide a simplified way of creating simple classes that hold data. ```python from dataclasses import dataclass @dataclass class Person: name: str birth_year: int ``` The above code is equivalent to: ```python class A: def __init__(name: str, birth_year: int): self.name = name self.birth_year = birth_year self.__post__init__() ``` Notice the call to `__post__init__` at the end. We can override that method to do whatever we'd like. I have found two great use cases for this. ## Use Case 1: Derived Fields Straight from the [Python documentation](https://docs.python.org/3/library/dataclasses.html#dataclasses.__post_init__), this use case is for when we want to use some variables to create a new variable. For example, to compute a new field `age` from a person's `birth_year`: ```python class Person: name: str birth_year: int age: int = field(init=False) def __post_init__(self): # Assuming the current year is 2024 and their birthday already passed self.age = 2024 - self.birth_year ``` ## Use Case 2: Validation Another use case is to make sure that the user instantiates the fields of a data class in a way we expect. ```python class Person: name: str birth_year: int def __post__init__(self): assert self.birth_year > 0 assert isinstance(self.name, str) ``` Nothing is stopping us from combining both of these use cases within the `__post_init__` method!