We added the compatibility of NumPy ufunc (#1096, #1106). Virtually all ufunc compatibilities in Koalas Series were implemented. See the example below:
>>> import databricks.koalas as ks >>> import numpy as np >>> kdf = ks.range(10) >>> kser = np.sqrt(kdf.id) >>> type(kser) <class 'databricks.koalas.series.Series'> >>> kser 0 0.000000 1 1.000000 2 1.414214 3 1.732051 4 2.000000 5 2.236068 6 2.449490 7 2.645751 8 2.828427 9 3.000000
We added the following new features:
koalas:
option_context (#1077)
option_context
koalas.DataFrame:
where (#1018)
where
mask (#1018)
mask
iterrows (#1070)
iterrows
koalas.Series:
pop (#866)
pop
first_valid_index (#1092)
first_valid_index
pct_change (#1071)
pct_change
koalas.Index
symmetric_difference (#953, #1059)
symmetric_difference
to_numpy (#1058)
to_numpy
transpose (#1056)
transpose
T (#1056)
T
dropna (#938)
dropna
shape (#1085)
shape
value_counts (#949)
value_counts
koalas.MultiIndex:
Fix comparison operators to treat NULL as False (#1029)
Make corr return koalas.DataFrame (#1069)
Include link to Help Thirsty Koalas Fund (#1082)
Add Null handling for different frames (#1083)
Allow Series.__getitem__ to take boolean Series (#1075)
Series.__getitem__
Produce correct output against multiIndex when ‘compute.ops_on_diff_frames’ is enabled (#1089)
Fix idxmax() / idxmin() for Series work properly (#1078)