Note
Click here to download the full example code
12.3.4.3. CategoricalsΒΆ
import pandas as pd
dataFrame = pd.DataFrame({"id": [1, 2, 3, 4, 5, 6], "raw_grade": ["a", "b", "b", "a", "a", "e"]})
dataFrame["grade"] = dataFrame["raw_grade"].astype("category")
dataFrame["grade"].cat.categories = ["very good", "good", "very bad"]
dataFrame["grade"] = dataFrame["grade"].cat.set_categories(["very bad", "bad", "medium", "good", "very good"])
dataFrame["grade"]
Out:
0 very good
1 good
2 good
3 very good
4 very good
5 very bad
Name: grade, dtype: category
Categories (5, object): ['very bad', 'bad', 'medium', 'good', 'very good']
dataFrame.sort_values(by="grade")
dataFrame.groupby("grade").size()
Out:
grade
very bad 1
bad 0
medium 0
good 2
very good 3
dtype: int64
Total running time of the script: ( 0 minutes 0.008 seconds)