About 11,500,000 results
Open links in new tab
  1. Converting an DataFrame from pandas to dask - Stack Overflow

    Oct 22, 2020 · I followed this documentation dask.dataframe.from_pandas and there are optional arguments called npartitions and chunksize. So I try write something like this: import …

  2. Dask DataFrame.to_parquet fails on read - Stack Overflow

    Mar 15, 2022 · Use dask.dataframe.read_parquet or other dask I/O implementations, not dask.delayed wrapping pandas I/O operations, whenever possible. Giving dask direct access …

  3. How to transform Dask.DataFrame to pd.DataFrame?

    Aug 18, 2016 · How can I transform my resulting dask.DataFrame into pandas.DataFrame (let's say I am done with heavy lifting, and just want to apply sklearn to my aggregate result)?

  4. Strategy for partitioning dask dataframes efficiently

    Jun 20, 2017 · The documentation for Dask talks about repartioning to reduce overhead here. They however seem to indicate you need some knowledge of what your dataframe will look …

  5. python - Why does Dask perform so slower while multiprocessing …

    Sep 6, 2019 · 36 dask delayed 10.288054704666138s my cpu has 6 physical cores Question Why does Dask perform so slower while multiprocessing perform so much faster? Am I using …

  6. dask: difference between client.persist and client.compute

    Jan 23, 2017 · More pragmatically, I recommend using persist when your result is large and needs to be spread among many computers and using compute when your result is small and …

  7. dask - Make Pandas DataFrame apply () use all cores? - Stack …

    Mar 19, 2019 · As of August 2017, Pandas DataFame.apply() is unfortunately still limited to working with a single core, meaning that a multi-core machine will waste the majority of its …

  8. Reading an SQL query into a Dask DataFrame - Stack Overflow

    May 24, 2022 · I'm trying create a function that takes an SQL SELECT query as a parameter and use dask to read its results into a dask DataFrame using the dask.read_sql_query function.

  9. Using Dask's NEW to_sql for improved efficiency (memory/speed) …

    Jun 16, 2020 · I tried .compute() to transform the dask dataframe to a pandas dataframe and then write to_sql, but that kind of defeated the purpose of using dask to read/data model and again …

  10. Comparison between Modin | Dask | Data.table - Stack Overflow

    May 27, 2021 · dask was the first, has large eco-system and looks really well documented, discussed in forums and demonstrated on videos. modin (ray) has some design choices which …