I'm using this approach to store data in a global array hosting an http server where certain requests will manipulate the global array.
I'm kind of worried about running into threading issues with certain operations -- mainly push
and splice
. I imagine if one request has me iterating over the array and removing items based on a conditional, while another request has me calling .push()
on the array that I'll run into issues. Can anyone confirm this?
I mostly write in C# where even a simple increment isn't thread safe (launching 25 threads that do i++, won't guarantee that i == 25 after all is said and done).
Update:
I've written 5 examples to demonstrate what I'm talking about. Test 1 and Test 3 work fine. Test 2 fails, because of well... what normally would be called threading issues (whether they are actual CPU threads or not). Test 4 and 5, when run in parallel seem to work (meaning they don't have collision problems like Test 2).
I'm using ApacheBench to test, making 1000 parallel requests.
This leads me to believe that Test 1 and Test 3 work fine because nodejs won't execute more than 1 instance of the javascript function in parallel (blocking?). As soon as you implement a setInterval/setTimeout, it frees up nodejs to execute another instance of the callback (non-blocking?).app.get('/test3'...)
callback in parallel
I'm really just trying to understand what the heck non-blocking I/O model
really means. Does it mean that "hey it's possible to do non-blocking with setTimeout and setInterval if you need non-blocking, otherwise we're going to block any other outer-level functions from being run until we exhaust the function we're on"? I feel it's imperative to know this so that I don't get myself into trouble thinking I could implement something like /test2 and be totally safe.
Also if I'm trying to be non-blocking with my callbacks, should I really be calling setTimeout(code, 1)
? Or is there a better way?
All are thread safe.
There are no threads, JavaScript is single threaded, it's impossible for two javascript statements to run at the same time.
As an aside, you shouldn't be using globals anyway because globals are evil
Edit:
Test 2 fails because you're using asynchronous callbacks, which means control goes back to node and it can handle more requests. This creates race conditions as seen.
In node, anything that's not asynchronous blocks. The only asynchronous things you have are setTimeout/setInterval/process.nextTick and any asynchronous IO operations.
One shouldn't manually make computation asychronous. One should just avoid doing too much computation.
I've written an article about What it means to be non-blocking
Raynos is mostly correct. But just because JavaScript is single threaded doesn't mean you can let your guards down. You still need to be aware of threads. A good example to illustrate this is how socket.io is maintaining connection with varies clients and still be able to keep track of these clients.
JavaScript's functional programming part works by variable binding (like...A LOT). Not being aware of threads makes you falsely assumes that your variables are the correctly bound. Like the socket.io example, how can you be sure the connection you're getting is the client you think you're getting? What if the reference binding went wrong and the connection actually references a different client?
This is the type of things you need to be careful of.