Loving Coding & Visual Design
Reading
About Pass-by-Value
http://javadude.com/articles/passbyvalue.htm
http://www.roguelazer.com/2015/02/beating-the-compiler/
On a Haswell processor, which has two "Vector Int ALU"s, you can theoretically be doing two AVX2 operations in parallel, each of which can be operating on eight 32-bit integers at once. According to the Intel 64 and IA32 Architectures Optimization Reference Manual, each of these instructions has a latency of 1 cycle, giving us a maximum theoretical PPC of 16.0, which is quite a lot. In fact, it's impossibly much. Transferring 16 32-bit words to the CPU per cycle would require 64 bytes/cycle of memory throughput, which translates on my CPU to 147GB/s of memory throughput. The PC3-12800 RAM in my system can do a maximum of 12.8GB/s per stick (25.6GBps total), which is only 11 bytes per cycle. That gives us a maximum throughput from memory of slightly above 2.75 PPC. Anything above that is probably an artifact of the lovely 128MB of eDRAM L4 cache on my CPU.
http://karpathy.github.io/neuralnets/
https://babeljs.io/docs/learn-es6/
http://javadude.com/articles/passbyvalue.htm
http://www.roguelazer.com/2015/02/beating-the-compiler/
On a Haswell processor, which has two "Vector Int ALU"s, you can theoretically be doing two AVX2 operations in parallel, each of which can be operating on eight 32-bit integers at once. According to the Intel 64 and IA32 Architectures Optimization Reference Manual, each of these instructions has a latency of 1 cycle, giving us a maximum theoretical PPC of 16.0, which is quite a lot. In fact, it's impossibly much. Transferring 16 32-bit words to the CPU per cycle would require 64 bytes/cycle of memory throughput, which translates on my CPU to 147GB/s of memory throughput. The PC3-12800 RAM in my system can do a maximum of 12.8GB/s per stick (25.6GBps total), which is only 11 bytes per cycle. That gives us a maximum throughput from memory of slightly above 2.75 PPC. Anything above that is probably an artifact of the lovely 128MB of eDRAM L4 cache on my CPU.
http://karpathy.github.io/neuralnets/
https://babeljs.io/docs/learn-es6/
最 近 文 章
- 写Adsense Blog的人居然分不清中文和日文 - Tue, 12 Sep 2006 01:12:49 +0000
- 记住这一天,是为了让它不再发生! - Mon, 11 Sep 2006 06:05:08 +0000
- 想成为百万富翁?先思考这三个问题。 - Thu, 07 Sep 2006 09:43:34 +0000
- Google的中美办公室对比 - Wed, 06 Sep 2006 05:12:48 +0000
- Apach1.3.31+PHP4.3.3安装出错:can't load of php4apache.dll - Tue, 05 Sep 2006 16:12:32 +0000
- 广州暂住人员如何申请港澳游 - Sat, 02 Sep 2006 16:25:54 +0000
- CSS样例:英文首字母特大写 - Fri, 25 Aug 2006 05:06:15 +0000
- 微软说:任何人都可以为XBOX开发游戏 - Wed, 23 Aug 2006 05:35:01 +0000
- 来自kiko.com的互联网创业经验 - Tue, 22 Aug 2006 05:17:16 +0000
- Google工具栏的安全浏览功能 - Thu, 17 Aug 2006 08:57:48 +0000