Version 0.24.0

NumPy’s universal function (ufunc) compatibility

We added the compatibility of NumPy ufunc (#1127). Virtually all ufunc compatibilities in Koalas DataFrame were implemented. See the example below:

>>> import databricks.koalas as ks
>>> import numpy as np
>>> kdf = ks.range(10)
>>> np.log(kdf)
         id
0       NaN
1  0.000000
2  0.693147
3  1.098612
4  1.386294
5  1.609438
6  1.791759
7  1.945910
8  2.079442
9  2.197225

Other new features and improvements

We added the following new features:

koalas:

koalas.DataFrame:

koalas.Index

koalas.MultiIndex:

koalas.SeriesGroupBy

koalas.DataFrameGroupBy

Other improvements

  • Setting index name / names for Series (#1079)

  • disable ‘str’ for ‘SeriesGroupBy’, disable ‘DataFrame’ for ‘GroupBy’ (#1097)

  • Support ‘compute.ops_on_diff_frames’ for NumPy ufunc compay in Series (#1128)

  • Support arithmetic and comparison APIs on same DataFrames (#1129)

  • Fix rename() for Index to support MultiIndex also (#1125)

  • Set the upper-bound for pandas. (#1137)

  • Fix _cum() for Series to work properly (#1113)

  • Fix value_counts() to work properly when dropna is True (#1116, #1142)