New build for development

smacl · 27-04-2019 9:23am #1

Hi Folks,

Looking at a new build for a development PC. Main requirements are as much memory as I can squeeze in, as many threads as possible, CUDA graphics and reasonable value now and upgradable as needs be. The main purpose of this PC will be VS2017 C++ development with an emphasis on working with huge unstructured geospatial datasets (point clouds) which will regularly run to billions of points. I'll also be working with very large bitmaps, video codecs and probably some CNN based AI like tensorflow going forward. No interest in games or overclocking but the PC will regularly be running 24x7 so don't want any components that are prone to overheat. I've put a prelim build below, will all be new parts, buying mid May, company card. Thinking is this build gives me room to grow, where I can drop in new gen Ryzen in next year and a new GPU if needs be. I know very little about GPUs other than a small bit of OpenCL and D3D programming, but want to stick with nVidia to have access to CUDA.

So, a bunch of questions for the experts;

I've already excluded laptops direct as a supplier based on bad experiences in the past, but do you reckon the rest of the merchants are reliable? I've no problem spending a few quid more to deal with a reputable supplier.
I've gone for DDR4-3200 ram, is this likely to be a bottleneck if I jump to next gen ryzen next year?
Am I right in saying that if I wanted more than 64GB of RAM I'd need to go for ThreadRipper or Intel?
Not really sure about which GPU to go for. Would I be better of going for something lower end to start off (thinking 1660ti) and jumping to the likes of 2080 if and when I need it?
What else would you change with this build?

PCPartPicker Part List

CPU: AMD - Ryzen 7 2700X 3.7 GHz 8-Core Processor (£272.99 @ Amazon UK)
Motherboard: MSI - B450 TOMAHAWK ATX AM4 Motherboard (£89.50 @ Box Limited)
Memory: Corsair - Vengeance LPX 32 GB (2 x 16 GB) DDR4-3200 Memory (£191.00 @ Amazon UK)
Memory: Corsair - Vengeance LPX 32 GB (2 x 16 GB) DDR4-3200 Memory (£191.00 @ Amazon UK)
Storage: Western Digital - Blue 1 TB M.2-2280 Solid State Drive (£86.49 @ Amazon UK)
Storage: Seagate - IronWolf 10 TB 3.5" 7200RPM Internal Hard Drive (£277.99 @ Box Limited)
Video Card: Gigabyte - GeForce RTX 2070 8 GB WINDFORCE Video Card (£457.98 @ Aria PC)
Case: Fractal Design - Meshify C Dark TG ATX Mid Tower Case (£78.84 @ Aria PC)
Power Supply: Corsair - SF 750 W 80+ Platinum Certified Fully-Modular SFX Power Supply (£187.91 @ Amazon UK)
Case Fan: be quiet! - Pure Wings 2 120 PWM 87 CFM 120mm Fan (£9.60 @ Amazon UK)
Case Fan: be quiet! - Pure Wings 2 120 PWM 87 CFM 120mm Fan (£9.60 @ Amazon UK)
Case Fan: be quiet! - Pure Wings 2 120 PWM 87 CFM 120mm Fan (£9.60 @ Amazon UK)
Case Fan: be quiet! - Pure Wings 2 120 PWM 87 CFM 120mm Fan (£9.60 @ Amazon UK)
Keyboard: Gigabyte - FORCE K83 Wired Standard Keyboard (£43.94 @ CCL Computers)
Mouse: Logitech - G603 Wireless Optical Mouse (£52.99 @ Box Limited)
Total: £1969.03
Prices include shipping, taxes, and discounts when available
Generated by PCPartPicker 2019-04-27 09:09 BST+0100

mp3guy · 27-04-2019 10:32am

I would definitely go Intel on the CPU. For vector instructions, oftentimes the latencies and clocks are better on Intel because well, it's Intel's instruction set. If you can afford it get a CPU that supports AVX512 for bleeding edge best performance.

smacl · 27-04-2019 10:48am

mp3guy wrote: »

I would definitely go Intel on the CPU. For vector instructions, oftentimes the latencies and clocks are better on Intel because well, it's Intel's instruction set. If you can afford it get a CPU that supports AVX512 for bleeding edge best performance.

Arguably better short term performance, but the plan is to drop in a 16 core 32 thread 3950x next year without any other mods needed to the system. VS2017 basically compiles one unit per available thread as separate processes so AVX512 can't help much here. The other issue with newer SIMD extensions is that I'm developing commercial software that needs to run on most hardware, so I can't actually use it in delivered products unless I'm putting a customer specific build together.

mp3guy · 27-04-2019 11:37am

smacl wrote: »

Arguably better short term performance, but the plan is to drop in a 16 core 32 thread 3950x next year without any other mods needed to the system. VS2017 basically compiles one unit per available thread as separate processes so AVX512 can't help much here. The other issue with newer SIMD extensions is that I'm developing commercial software that needs to run on most hardware, so I can't actually use it in delivered products unless I'm putting a customer specific build together.

The Ryzen 3000 series will still only have AVX512 so the high end Intel CPUs will have double the vector flops. Half the cores on an Intel CPU with AVX512 will crush an AMD CPU with just AVX2.

Not sure what you're trying to say about VS2017; https://devblogs.microsoft.com/cppblog/microsoft-visual-studio-2017-supports-intel-avx-512/

But, your end point really hits the nail on the head as to why you'd stick with AMD, but then that conflicts with your desire for vendor specific GPGPU via CUDA

smacl · 27-04-2019 12:18pm

mp3guy wrote: »

Not sure what you're trying to say about VS2017; https://devblogs.microsoft.com/cppblog/microsoft-visual-studio-2017-supports-intel-avx-512/

The C++ compiler itself uses one single threaded process for each file it is compiling, so 16 threads means 16 source files compiling at a time. SIMD (single instruction. multiple data) doesn't help this as we're talking multiple instructions, multiple data across multiple processes. This is also true of any code I write myself, where the SIMD benefits come largely from compiler optimizations rather than explicit code whereas the multi-threaded optimizations come from the code. Typically, a piece of code that would benefit from explicit SIMD/AVX code would be a good candidate for porting to the GPU. For the same number of threads Intel is faster, so if I wasn't going to upgrade a i9-9900 would be a better bet, but the AMD gives me the option to jump to 32 threads for ~€600 next year where there's no sign of this being an option for Intel.

But, your end point really hits the nail on the head as to why you'd stick with AMD, but then that conflicts with your desire for vendor specific GPGPU via CUDA

The CUDA stuff can be ported onto AMD GPUs with HIP/hipify but it is extra work I don't want in my development cycle. Good enough to go through this on developed code and test on another box. It's also the case in my industry that people commonly spec nVidia cards for CUDA apps but also use Ryzen and Threadripper.

mp3guy · 27-04-2019 12:32pm

smacl wrote: »

The C++ compiler itself uses one single threaded process for each file it is compiling, so 16 threads means 16 source files compiling at a time. SIMD (single instruction. multiple data) doesn't help this as we're talking multiple instructions, multiple data across multiple processes. This is also true of any code I write myself, where the SIMD benefits come largely from compiler optimizations rather than explicit code whereas the multi-threaded optimizations come from the code. Typically, a piece of code that would benefit from explicit SIMD/AVX code would be a good candidate for porting to the GPU. For the same number of threads Intel is faster, so if I wasn't going to upgrade a i9-9900 would be a better bet, but the AMD gives me the option to jump to 32 threads for ~€600 next year where there's no sign of this being an option for Intel.

Oh compilation of code versus execution. Sure, vector instructions aren't very useful for compilation, but at runtime make all the difference. You can just write using intrinsics if you want to leverage the SIMD instructions without writing assembly, then you don't have to worry about the compiler being too dumb to do it automatically, but you get the register allocation for free. Porting to the GPU would only make sense if the GPU is not required for something else, e.g. rendering in a game loop. But I get where you're coming from, you want fast compilation and there many cores (and M.2 SSDs in RAID 0) make all the difference (once you don't also have a lot of linking to do).

smacl wrote: »

The CUDA stuff can be ported onto AMD GPUs with HIP/hipify but it is extra work I don't want in my development cycle. Good enough to go through this on developed code and test on another box. It's also the case in my industry that people commonly spec nVidia cards for CUDA apps but also use Ryzen and Threadripper.

I typically never trust any automatic converts like that, the fact of the matter is NVIDIA pumped resources in to CUDA to sell GPUs while OpenCL and AMD rotted in a corner with just Compute Shaders and the like. Much better developer experience with CUDA, plus more neat features.

smacl · 27-04-2019 1:20pm

mp3guy wrote: »

Oh compilation of code versus execution. Sure, vector instructions aren't very useful for compilation, but at runtime make all the difference. You can just write using intrinsics if you want to leverage the SIMD instructions without writing assembly, then you don't have to worry about the compiler being too dumb to do it automatically, but you get the register allocation for free. Porting to the GPU would only make sense if the GPU is not required for something else, e.g. rendering in a game loop. But I get where you're coming from, you want fast compilation and there many cores (and M.2 SSDs in RAID 0) make all the difference (once you don't also have a lot of linking to do).

Second M.2 might be worth considering there, and linking is also a plus for Intel, though tends not to be an issue in my typical development cycle. The point about the GPU is well made. While it won't typically be rendering for me, mixing GPU and multi-threading can lead to GPU resource issues, where SIMD wont.

I typically never trust any automatic converts like that, the fact of the matter is NVIDIA pumped resources in to CUDA to sell GPUs while OpenCL and AMD rotted in a corner with just Compute Shaders and the like. Much better developer experience with CUDA, plus more neat features.

Same, been down that road once with OpenCL which seems to be dying a death and similarly AMP. The Direct3D compute shader is my current choice for Windows apps, but no good for Unix or Mac. CUDA also has a bunch of good library code out there, which makes it attractive and has me leaning towards nVidia for dev at least.

New build for development

Comments