Hi, thanks to all of you who spend time on this project, it is invaluable to me.
Unfortuatnely, i think I have run into an indexing-type issue, either that or my expectations are incorrect. Consider the following script:
import random import pandas import numpy as np columns = list('abcdefg') subcols = columns[1:-1] X = pandas.DataFrame(0.0, columns=columns, index=range(100)) Z = pandas.DataFrame(np.random.randn(100,len(subcols)), columns=subcols, index=range(100)) block1 = list(subcols) random.shuffle(block1) X[block1] -= Z print X.corrwith(Z) X = pandas.DataFrame(0.0, columns=columns, index=range(100)) X[block1] -= Z[block1] print X.corrwith(Z) print pandas.__version__
The output i get is something like this (obviously depends on the random inputs)
a NaN
b -0.127386
c -0.073521
d -0.073521
e -0.127386
f -1.000000
g NaN
dtype: float64
a NaN
b -1
c -1
d -1
e -1
f -1
g NaN
dtype: float64
0.15.0rc1-10-g215569a
where the second result is what i would have expected.
I was surprised to find that I have to "reindex" Z by block1 to get a correct result. Is my expectation incorrect?
Many thanks
RetroSearch is an open source project built by @garambo | Open a GitHub Issue
Search and Browse the WWW like it's 1997 | Search results from DuckDuckGo
HTML:
3.2
| Encoding:
UTF-8
| Version:
0.7.4