You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
%%time
rawdat = np.random.rand(5400000,768)
dat = pd.DataFrame(rawdat,columns=[str(i) for i in range(rawdat.shape[1])])
ums = umap.UMAP(metric='cosine', #def euclidean
min_dist=0.1, # def 0.1
n_neighbors=15, # def 15
n_components=2, #def 2
transform_seed=1,
verbose=True
).fit(dat)
the python kernel crashes after ~4 minutes, with output:
UMAP(angular_rp_forest=True, metric='cosine', transform_seed=1, verbose=True)
Thu Oct 10 13:51:34 2024 Construct fuzzy simplicial set
Thu Oct 10 13:51:37 2024 Finding Nearest Neighbors
Thu Oct 10 13:51:39 2024 Building RP forest with 64 trees
never reaching "NN descent".
Memory should not be a problem. My machine (macbook pro) has 128GB of RAM. Memory usage as reported by htop reaches ~30GB just before the kernel crash.
The text was updated successfully, but these errors were encountered:
After usual imports
I have
python version: 3.9.18
umap version: 0.5.6
My dataset has 5,400,000 points in a 768 dim space. For 540,000 points, umap seems to work fine:
yields
But for 5,400,000 data points,
the python kernel crashes after ~4 minutes, with output:
never reaching "NN descent".
Memory should not be a problem. My machine (macbook pro) has 128GB of RAM. Memory usage as reported by htop reaches ~30GB just before the kernel crash.
The text was updated successfully, but these errors were encountered: