We added the compatibility of NumPy ufunc (#1127). Virtually all ufunc compatibilities in Koalas DataFrame were implemented. See the example below:
>>> import databricks.koalas as ks >>> import numpy as np >>> kdf = ks.range(10) >>> np.log(kdf) id 0 NaN 1 0.000000 2 0.693147 3 1.098612 4 1.386294 5 1.609438 6 1.791759 7 1.945910 8 2.079442 9 2.197225
We added the following new features:
koalas:
to_numeric (#1060)
to_numeric
koalas.DataFrame:
idxmax (#1054)
idxmax
idxmin (#1054)
idxmin
pct_change (#1051)
pct_change
info (#1124)
info
koalas.Index
fillna (#1102)
fillna
min (#1114)
min
max (#1114)
max
drop_duplicates (#1121)
drop_duplicates
nunique (#1132)
nunique
sort_values (#1120)
sort_values
koalas.MultiIndex:
levshape (#1086)
levshape
koalas.SeriesGroupBy
head (#1050)
head
koalas.DataFrameGroupBy
Setting index name / names for Series (#1079)
disable ‘str’ for ‘SeriesGroupBy’, disable ‘DataFrame’ for ‘GroupBy’ (#1097)
Support ‘compute.ops_on_diff_frames’ for NumPy ufunc compay in Series (#1128)
Support arithmetic and comparison APIs on same DataFrames (#1129)
Fix rename() for Index to support MultiIndex also (#1125)
Set the upper-bound for pandas. (#1137)
Fix _cum() for Series to work properly (#1113)
Fix value_counts() to work properly when dropna is True (#1116, #1142)