Skip to main content
Filter by
Sorted by
Tagged with
0 votes
1 answer
133 views

I have created a structure, which provided a dataset as opts to a ProcessPoolExecutor and the inputs are the indices for the dataset. I could provide a MWE, but I tried several approaches and all ...
MaKaNu's user avatar
  • 1,076
1 vote
0 answers
58 views

I need to calculate many shortest paths between a set of starting points (origin_nodes) and a set of end points (target_nodes) in a very large graph. The graph can be over 50gb in ram. I could, in ...
MGN's user avatar
  • 505
0 votes
1 answer
77 views

I am trying to code a simulation in SAS with nested RSUBMITs; the nested RSUBMITs are within a macro. I cannot get the macro variables to pass correctly into or out of the 2nd RSUBMIT. Here is a very ...
wellhello's user avatar
0 votes
0 answers
109 views

I am developping a R function than used a function code in Rcpp (i am already new to this). I wanted to make simulations and run the same r functions in a cluster of x cores. Unfortunately when I do ...
Max13's user avatar
  • 37
1 vote
1 answer
85 views

I'm running a simulation using the future and future.apply packages in R where I need to execute multiple iterations of a function in parallel and bind the results together. When I use more than one ...
Linus's user avatar
  • 343
0 votes
0 answers
32 views

I have a compute shader that culls object triangles against frustums. For culling, I use a huge vertex and index buffer and a pair of (offset, count) to identify the range of vertices for a single ...
Raildex's user avatar
  • 5,428
3 votes
1 answer
100 views

I am attempting to parallelise a calculation and consolidate the results into a matrix. A large number of calculations are performed and each one contributes to a summed matrix of all the results. ...
Neil Butcher's user avatar
  • 1,074
-1 votes
2 answers
200 views

To optimize the performance of a C# application and came across AsParallel() in LINQ. I want to understand the key differences between them, especially regarding performance when working with larger ...
Yassine Zakhama's user avatar
0 votes
1 answer
57 views

I'm trying to parallelise with openxlsx and its function writeData the export of many Excel individual files that should be somehow summarised in a central Excel file. As can be shown in the reprex ...
Olivier7121's user avatar
0 votes
1 answer
51 views

I am trying to solve coupled ODEs, but I present a small part of the problem here. I have tried to solve for R along the entire grid. The grid size is too big to run conventional methods, hence I am ...
Rahul Rai's user avatar
0 votes
0 answers
53 views

Below is my sequential reprex Shiny App that I want to run in a parallel environment: lapply(c("shiny", "DT", "parallel"), library, character.only = TRUE) ui <- ...
Olivier7121's user avatar
0 votes
1 answer
49 views

I used to work with usual sync programming and the archtecture implied that if you need anything to run a parallel, you queue it in message system and you would spawn extra process on the same or ...
user3791838's user avatar
0 votes
0 answers
31 views

I have a problem with parallel processing in Apache Camel. I am consuming a batch of 10 messages from Kafka, which gives me an exchange with list of exchanges. I want to process these exchanges in ...
Tom's user avatar
  • 1
0 votes
0 answers
70 views

I am implementing a Docker setup for running Dagster and PySpark together. My docker-compose.yml file looks like this: dagster: container_name: dagster hostname: dagster build: ...
Tavakoli's user avatar
  • 1,433
1 vote
2 answers
178 views

I have a large script where I am processing terabytes of weather/climate data that comes in gridded format. I have a script that uses an outer loop (over years - 1979 to 2024), and for each year, ...
user8229029's user avatar
  • 1,214
0 votes
0 answers
34 views

I have a function that has a numba @njit wrapper around it to make it faster, I've set parallel=True to make it run faster. And now want to measure the time it takes, using time.process_time(), ...
Ninja Jim's user avatar
2 votes
0 answers
74 views

I have a computation with a following structure (pseudocode): intermediate_results = [] for source in sources: # (1) source_data = prepare( load( source ) ) # (2) for sample in ...
meditative potato's user avatar
0 votes
0 answers
32 views

I tried to use arrow::recoredbatchreader to read multiple rowgroups from a parquet file in parallelism. I use GetRecordBatchReader to acquire recordbatchreader. However, I noticed that when the number ...
fcqiao's user avatar
  • 19
0 votes
1 answer
117 views

I would like to partition a list into a sublists, and send a request for each sublist in parallel. I have a list of product ids, want to partition it to be sublists with the size of 3. List<List<...
Yujie Huang's user avatar
-3 votes
1 answer
91 views

I am using the metpy package to calculate many different weather parameters for many different locations across North America for many different hours. I want to fill arrays containing these weather ...
user8229029's user avatar
  • 1,214
1 vote
1 answer
102 views

I want to read a parquet file batch by batch in parallelism. I achieve this by merge multiple continuous rowgroup together and read them by arrow::RecordBatchReader. When I monitor the memory usage ...
fcqiao's user avatar
  • 19
0 votes
1 answer
76 views

I am converting a large data frame into a big.matrix object to enable parallel processing (otherwise, the data frame is too large and I run out of RAM). My code is currently like this: df <- data....
JuM24's user avatar
  • 161
1 vote
2 answers
167 views

I have a procedure which is taking time to execute. The procedure is being called from Oracle APEX upon clicking on a submit button, but it times out after 30 mins. Since the users doesn't want to ...
Dito's user avatar
  • 55
1 vote
0 answers
41 views

I'm trying to load and process large amounts of HDF5 files using Python and convert them into dataframes. The HDF5 files are scattered in equal-sized batches. I've tried two approaches using ...
Dilan's user avatar
  • 11
2 votes
2 answers
312 views

Suppose there are 4 CUDA devices (0,1,2,3) on my computer and there are 10 tasks to run, each tasks is a script named run01.sh, run02.sh, ..., run10.sh. The problem is, each task use only 1 GPU, I ...
link89's user avatar
  • 1,979
1 vote
2 answers
179 views

I developed this function in C to read a file consisting of one word per line, like a standard wordlist. The function has already been optimized to a reasonable extent, but I would like to know if ...
ric's user avatar
  • 101
1 vote
1 answer
45 views

I have a PowerShell script that performs remote execution of a script on multiple servers. The script checks whether a particular script exists on each server, and if it does, it invokes that script ...
anonymousresponse automation's user avatar
0 votes
2 answers
79 views

The question pretty much says it all. This is for benchmarking purposes. I really do need to target specific cores on specific nodes. Targeting particular nodes is not enough in and of itself for ...
bob.sacamento's user avatar
0 votes
1 answer
111 views

I'm facing the problem of reducing values to threads in warps that shares the same variable's content. More specifically, in order to avoid atomic add operation on the an array i'm evaluating ...
ctrlaltdel's user avatar
0 votes
0 answers
31 views

I want to use multiprocessing (Python) to simulate the following scenario: there are multiple computing centers, each with multiple computers, and these computers can execute tasks in parallel. ...
cathy willy's user avatar
2 votes
0 answers
179 views

While using hugging face transformers, when calling the .generate, how are the input prombt internally executed? and how does batch size make a difference? For instance, input dimension (1,128) vs (10,...
AxRy's user avatar
  • 45
0 votes
0 answers
35 views

Apologies in advance if this is a mundane or unclear question. I want to scale up a workflow on on a cluster to run a program concurrently on several nodes. The program in question references a large, ...
gladshire's user avatar
0 votes
0 answers
54 views

We have a very processing heavy flux pipeline. To speed it up we are using parallel flux. But now the problem is that everything is waiting behind Database connection and its still slow. The entire ...
Marko Taht's user avatar
  • 1,558
0 votes
0 answers
35 views

I am wondering if it is possible to run two parallel processes in Jupyter Notebook, in two different cells, so that each process can calculate and print its own results under the cell. So far I have ...
Alex's user avatar
  • 357
2 votes
2 answers
145 views

I'm working on a Python project where I need to process a very large file (e.g., a multi-gigabyte CSV or log file) in parallel to speed up processing. However, I have three specific requirements that ...
Meeooowwww's user avatar
0 votes
2 answers
78 views

.NET 8.0 NUnit 4.1.0 I have a bunch of tests that are wrapped within a TestFixture attribute. Each test case is currently opening a file as a FileParameter and passing that to a function: string path ...
Ewan's user avatar
  • 816
1 vote
1 answer
635 views

I am trying to speed up the following dlt (dlthub) pipeline via parallelisation as shown in the documentation here: https://dlthub.com/docs/reference/performance#parallelism Here is the original (NOT ...
Elis Evans's user avatar
0 votes
1 answer
101 views

I am new to multiprocessing, so this might be a stupid question. I am using Ubuntu 20.04.6 LTS (64-bit) with a 12th Gen Intel(R) Core(TM) i7-12700K processor and 16GB of RAM under Python 3.9.19. When ...
user17349469's user avatar
1 vote
0 answers
55 views

Does Java parallel stream process all substreams in worker threads, I see the main thread is also used as a worker thread. Sample program: package org.example; import java.util.Arrays; import java....
Jeff Xu's user avatar
  • 31
2 votes
1 answer
474 views

The application that I'm testing has single-user per session enforcement, so I've reduced the number of workers to 1. This has created the problem that my entire Playwright suite takes over an hour to ...
user22068518's user avatar
0 votes
1 answer
106 views

I have the requirement to execute an Azure Function (let's call it OperationalFunction) only once in parallel. What I mean is the following: I have two "entry point" functions: I have a ...
RB_D4S's user avatar
  • 1
0 votes
1 answer
38 views

I have a number of workstations that run long processes containing sequences like this: x = wait_while_current_is_set y = read_voltage z = z + y The workstations must maintain synchronization with a ...
david's user avatar
  • 2,706
2 votes
1 answer
267 views

I wrote some code trying to parallelize my wandb sweeps since the model I am working with takes a long time to converge and I have a lot of subprocesses to sweep through. Basically I don’t have the ...
Leofierus's user avatar
  • 107
0 votes
0 answers
74 views

My "outer" code running under a thread (obviously). That outer code calls Parallel.ForEach and/or Task.Run, and the outer thread also gets used inside those methods. Consider this code: ...
Deane's user avatar
  • 8,815
0 votes
2 answers
340 views

Disclaimer: Somewhat new to deep diving into how the hardware actually executes instructions. Reading "Game Engine Architecture" by Jason Gregory, and I'm on the Implicit & Explicit ...
preets's user avatar
  • 1
2 votes
1 answer
154 views

I'm playing around with kotlin coroutines and now I am testing it on Nilakantha series (formula for calculating Pi). Here is my code: import kotlinx.coroutines.* import kotlin.system.measureTimeMillis ...
k1ng0fTerab1th1a's user avatar
1 vote
2 answers
152 views

The Python code file is provided below. I'm using Python 3.10.12 on a Linux mint 21.3 (in case any of these info are needed). The one with a pool of 2 workers takes more time than the one without any ...
AmirHosein Sadeghimanesh's user avatar
1 vote
0 answers
59 views

The following is my source code for parallel calculation matrix multiplication, which tests the parallel efficiency obtained by the number of 1, 2, 4, 8, 16, and 32 threads, respectively. My Operating ...
Noospic's user avatar
  • 25
-2 votes
1 answer
116 views

I have written a code which works on CPU cores very well. But it's not fast enough for me. I want to run it on CUDA cores and I already tried to write a kernel for the montecarlo part of it and etc. ...
Mohammad Jafari's user avatar
0 votes
1 answer
68 views

I am trying to parallelize a program for finding local maxima using reduction. However, I am encountering a problem during the merging process: the merged array ends up containing exactly two fewer ...
Arman's user avatar
  • 3