Trying to get started following the 0.2 tutorials. This references the Table Join exercise using the movie example data to answer the question “What genres are rated most differently by males and females?”
My code is below. Is there a more efficient solution to this problem? Is there a way to avoid creating the two new tables?
hail seems to be notifying me that it is sorting the data multiple times. Why? Is there a way to avoid this?
What prerequisite knowledge in distributed computing should I have? Can anyone suggest additional resources for learning patterns for more complex queries?