Skip to content

Latest commit

 

History

History
53 lines (41 loc) · 1.07 KB

File metadata and controls

53 lines (41 loc) · 1.07 KB

2018 Notes

PyData NYC Oct 2018

pandas: Integer NA as a first class citizen
Jeff Reback (Two Sigma)

  https://www.youtube.com/watch?v=gxvTVxlvH9w
  https://github.com/jreback/pydata_nyc-2018

  • Extension Arrays - Specification of how external arrays interface with pandas
    • First class data dtypes
    • Decouple memory management and processing from pandas (and numpy)
  • 0.24 - First class dtypes
    • Integer NA
      • dtype='Int64' (capitalized!)
      • Uses IntegerArray instead of numpy array
      • has values and a mask
    s.values        # IntegerArray([1, 2, nan], dtype='Int64')
    s.values._data  # array([1, 2, 1])
    s.values._mask  # array([False, False, True])

Measuring Model Fairness
J. Henry Hinnefeld (Civis Analytics)

  https://www.youtube.com/watch?v=V3tmxMf2UH8




__

  
  




__

  
  




__