MMM
Page 1 of 9 1234 ... LastLast
Results 1 to 25 of 217

Thread: PhysX on a CPU likely to see very little benefit from SSE recompile

  1. #1
    Registered User
    Join Date
    Feb 2010
    Location
    NVIDIA HQ
    Posts
    76

    PhysX on a CPU likely to see very little benefit from SSE recompile

    http://scalibq.spaces.live.com/Blog/...0661!387.entry

    The results speak for themselves. Although there certainly is some performance left on the table by nVidia, it is nowhere near as dramatic as Kanter claimed. In synthetic tests, there is about 8% to be gained from recompiling. This is nowhere near the 2-4x figure that Kanter was using. In fact, 8% faster PhysX processing would mean even less than 8% higher framerates in games, since PhysX is not the only CPU-intensive task in a game. Perhaps the net gain in framerate would be closer to 3-4%, depending on the game. In other words, recompiling PhysX with SSE would not make CPUs threaten GPU physics. Not even close. The difference would be lost in the margin of error, most likely.
    NVIDIA Forums Administrator

  2. #2
    Xtreme Addict
    Join Date
    Apr 2007
    Posts
    2,128
    Great job Nvidia PR warrior! Now get lost.

    To anyone interested, the paragraph does NOT talk about recompiling PhysX to take advantage of SSE! It talks about recompiling Bullet physics to take advantage of SSE. It has NOTHING to do with PhysX. Well, still there will be people thinking that somone actually tested PhysX with SSE.

    Considering the pro-Nvidia source, broguth by an Nvidia warrior... Meh, I'll stick to Box2D.
    Last edited by Calmatory; 09-23-2010 at 10:55 AM.

  3. #3
    Xtreme Enthusiast
    Join Date
    Apr 2006
    Posts
    939
    Explain WHY nvidia chose to use x87 in the first place then I might care about this blog. Otherwise it looks like nvidia went way way out of their way to make physx run 8% slower.

  4. #4
    I am Xtreme
    Join Date
    Dec 2007
    Posts
    7,750
    so much hate in that article. basically a smear campaign against Kanter,

    it also goes on about the history of who own what, and its making us realize that physics in games is trying so hard to get money, instead of building quality.

  5. #5
    Xtreme Addict
    Join Date
    Apr 2007
    Posts
    1,870
    Quote Originally Posted by Iconyu View Post
    Explain WHY nvidia chose to use x87 in the first place then I might care about this blog. Otherwise it looks like nvidia went way way out of their way to make physx run 8% slower.
    Nvidia didn't choose to use x87, they bought Novodex that way. They didn't go out of their way to do anything, they just didn't go out of their way to improve it. At least try to do some research guys before jumping on the Nvidia hating bandwagon

  6. #6
    I am Xtreme
    Join Date
    Jul 2007
    Location
    Austria
    Posts
    5,485
    Quote Originally Posted by Calmatory View Post
    Great job Nvidia PR warrior! Now get lost.

    To anyone interested, the paragraph does NOT talk about recompiling PhysX to take advantage of SSE! It talks about recompiling Bullet physics to take advantage of SSE. It has NOTHING to do with PhysX. Well, still there will be people thinking that somone actually tested PhysX with SSE.

    Considering the pro-Nvidia source, broguth by an Nvidia warrior... Meh, I'll stick to Box2D.
    lol indeed.. read a few posts and realized they where talking about bullet...

  7. #7
    Xtreme X.I.P. Particle's Avatar
    Join Date
    Apr 2008
    Location
    Kansas
    Posts
    3,219
    Hey, if you don't like the taste of French dressing then you don't like ranch either. They're both dressings after all and can thus be treated as if they're the same entity.

    As with my real life example of dressing, the statement may be true. Just because Fact A, though, it isn't necessarily proof that Fact B.
    Particle's First Rule of Online Technical Discussion:
    As a thread about any computer related subject has its length approach infinity, the likelihood and inevitability of a poorly constructed AMD vs. Intel fight also exponentially increases.

    Rule 1A:
    Likewise, the frequency of a car pseudoanalogy to explain a technical concept increases with thread length. This will make many people chuckle, as computer people are rarely knowledgeable about vehicular mechanics.

    Rule 2:
    When confronted with a post that is contrary to what a poster likes, believes, or most often wants to be correct, the poster will pick out only minor details that are largely irrelevant in an attempt to shut out the conflicting idea. The core of the post will be left alone since it isn't easy to contradict what the person is actually saying.

    Rule 2A:
    When a poster cannot properly refute a post they do not like (as described above), the poster will most likely invent fictitious counter-points and/or begin to attack the other's credibility in feeble ways that are dramatic but irrelevant. Do not underestimate this tactic, as in the online world this will sway many observers. Do not forget: Correctness is decided only by what is said last, the most loudly, or with greatest repetition.

    Rule 3:
    When it comes to computer news, 70% of Internet rumors are outright fabricated, 20% are inaccurate enough to simply be discarded, and about 10% are based in reality. Grains of salt--become familiar with them.

    Remember: When debating online, everyone else is ALWAYS wrong if they do not agree with you!

    Random Tip o' the Whatever
    You just can't win. If your product offers feature A instead of B, people will moan how A is stupid and it didn't offer B. If your product offers B instead of A, they'll likewise complain and rant about how anyone's retarded cousin could figure out A is what the market wants.

  8. #8
    Xtreme Addict
    Join Date
    Jul 2004
    Location
    U.S of freakin' A
    Posts
    1,931
    Quote Originally Posted by Iconyu View Post
    Explain WHY nvidia chose to use x87 in the first place then I might care about this blog. Otherwise it looks like nvidia went way way out of their way to make physx run 8% slower.
    Nvidia didn't "choose" to use x87. People often seem to forget that PhysX was created by Ageia, and was designed primarily to run on dedicated hardware.. Software driven PhysX was merely an afterthought on the PC, hence why it wasn't optimized thoroughly unlike other software Physics APIs.

    On consoles however, PhysX runs on the CPU and is optimized for it very well from what I've heard.

    So intent needs to be taken into account here..
    Intel Core i7 6900K
    Noctua NH-D15
    Asus X99A II
    32 GB G.Skill TridentZ @ 3400 CL15 CR1
    NVidia Titan Xp
    Creative Sound BlasterX AE-5
    Sennheiser HD-598
    Samsung 960 Pro 1TB
    Western Digital Raptor 600GB
    Asus 12x Blu-Ray Burner
    Sony Optiarc 24x DVD Burner with NEC chipset
    Antec HCP-1200w Power Supply
    Viewsonic XG2703-GS
    Thermaltake Level 10 GT Snow Edition
    Logitech G502 gaming mouse w/Razer Exact Mat
    Logitech G910 mechanical gaming keyboard
    Windows 8 x64 Pro

  9. #9
    Xtreme Addict
    Join Date
    Apr 2007
    Posts
    2,128
    Great, the blog post states that sometimes x87 comes out on top. Since I don't have any deeper understanding of inner workings of Bullet, nor any real practical experience of SSE.. Is that even possible?

    Can there be cases when SSE comes out slower than x87?

    For example single precision matrix multiplication can be what, 4x faster in SSE than in x87(One can do 4 MULs parallel, right?)? I have hard time believing that any single operation implemented in SSE would be slower than x87 equivialient. The results.. sound fishy at best. Oh, and AVX could potentially double the single precision speed, allowing 8 parallel MULs. ;
    Last edited by Calmatory; 09-23-2010 at 11:18 AM.

  10. #10
    I am Xtreme
    Join Date
    Dec 2007
    Posts
    7,750
    if Physx had 3 things:
    unlimited multi core support for those with quads, hexi, or 8 threads, on all games
    and
    the ability to adjust intensity with finite detail
    and
    for a dedicated gpu for physx to work on any brands video output (ati for gaming, nvidia for physx)

    Physx would gain ALOT of peoples respect. for those who think that wouldnt help nvidia at all, its not really true. if things look great, and people want more, they would probably go and buy a dedicated card to get the most out of it. but it wouldnt take away the experience completely for the majority of the world and there wouldnt be so much disrespect sent their way. if bullet ever takes off, and as havok does great things since cpus are gaining lots of cores, but mhz are not increasing much, there will be plenty of extra power left on the cpu for havoc. both of those things will threaten their business, and they need to setting the standard, instead of trying to milk the market with laziness.

  11. #11
    Xtreme Addict
    Join Date
    Jul 2004
    Location
    U.S of freakin' A
    Posts
    1,931
    Quote Originally Posted by Calmatory View Post
    Great job Nvidia PR warrior! Now get lost.

    To anyone interested, the paragraph does NOT talk about recompiling PhysX to take advantage of SSE! It talks about recompiling Bullet physics to take advantage of SSE. It has NOTHING to do with PhysX. Well, still there will be people thinking that somone actually tested PhysX with SSE.

    Considering the pro-Nvidia source, broguth by an Nvidia warrior... Meh, I'll stick to Box2D.
    The comparison is still relevent, even if it's not PhysX. The point the author was making, is that the huge gains Kanter talked about in his article from "simply recompiling" is very unrealistic..
    Intel Core i7 6900K
    Noctua NH-D15
    Asus X99A II
    32 GB G.Skill TridentZ @ 3400 CL15 CR1
    NVidia Titan Xp
    Creative Sound BlasterX AE-5
    Sennheiser HD-598
    Samsung 960 Pro 1TB
    Western Digital Raptor 600GB
    Asus 12x Blu-Ray Burner
    Sony Optiarc 24x DVD Burner with NEC chipset
    Antec HCP-1200w Power Supply
    Viewsonic XG2703-GS
    Thermaltake Level 10 GT Snow Edition
    Logitech G502 gaming mouse w/Razer Exact Mat
    Logitech G910 mechanical gaming keyboard
    Windows 8 x64 Pro

  12. #12
    I am Xtreme
    Join Date
    Dec 2007
    Posts
    7,750
    Quote Originally Posted by Carfax View Post
    The comparison is still relevent, even if it's not PhysX. The point the author was making, is that the huge gains Kanter talked about in his article from "simply recompiling" is very unrealistic..
    to sum the article up:
    one troll attacks another troll, and we get a news post out of it, lol

  13. #13
    Xtreme Addict
    Join Date
    Apr 2007
    Posts
    1,870
    Quote Originally Posted by Manicdan View Post
    if Physx had 3 things:
    unlimited multi core support for those with quads, hexi, or 8 threads, on all games
    and
    the ability to adjust intensity with finite detail
    and
    for a dedicated gpu for physx to work on any brands video output (ati for gaming, nvidia for physx)
    Which other API gives you any one of those things (rhetorical question)? It's funny how PhysX has to be so awesome in order to get respect yet no other API has any of those qualities you just described.

  14. #14
    I am Xtreme
    Join Date
    Dec 2007
    Posts
    7,750
    Quote Originally Posted by trinibwoy View Post
    Which other API gives you any one of those things (rhetorical question)? It's funny how PhysX has to be so awesome in order to get respect yet no other API has any of those qualities you just described.
    havok is built to work on CPUs and is not overly saturated. look at HL2, just enough to play the game, even on 2ghz. mafia2 has effects that push hardware to the limits just cause smoke is so dense, or everyone and their mother has to wear a cape or trench coat.

    phsyx is known for trying to put too much of something that adds nothing except a higher cost on your electric bill. also their arrogance, like not supporting those who bought a nvidia gpu to work with an ati as their main output. and disabling the older ageia PPU "just because"

    havok is not the same as phsyx at all. both have different histories and different methods for implementation, and should be judged independently

  15. #15
    Xtreme Addict
    Join Date
    Jul 2004
    Location
    U.S of freakin' A
    Posts
    1,931
    Quote Originally Posted by Manicdan View Post
    if Physx had 3 things:
    unlimited multi core support for those with quads, hexi, or 8 threads, on all games
    PhysX already has multicore support, but it's up to the developer to take advantage of it. Metro 2033 already demonstrated this.
    Intel Core i7 6900K
    Noctua NH-D15
    Asus X99A II
    32 GB G.Skill TridentZ @ 3400 CL15 CR1
    NVidia Titan Xp
    Creative Sound BlasterX AE-5
    Sennheiser HD-598
    Samsung 960 Pro 1TB
    Western Digital Raptor 600GB
    Asus 12x Blu-Ray Burner
    Sony Optiarc 24x DVD Burner with NEC chipset
    Antec HCP-1200w Power Supply
    Viewsonic XG2703-GS
    Thermaltake Level 10 GT Snow Edition
    Logitech G502 gaming mouse w/Razer Exact Mat
    Logitech G910 mechanical gaming keyboard
    Windows 8 x64 Pro

  16. #16
    I am Xtreme
    Join Date
    Dec 2007
    Posts
    7,750
    Quote Originally Posted by Carfax View Post
    PhysX already has multicore support, but it's up to the developer to take advantage of it. Metro 2033 already demonstrated this.
    which is why i said all games.
    im not saying they didnt do it, we already went over that before.
    just saying nvidia needs to take care of it (and the article says they are, which is good)

    i would like to point out that i do not hate nvidia, nor am i actually mad. i am discussing the issues that the general public has run into and the things holding back a feature that should be much "cooler" than it has been.

  17. #17
    Xtreme Addict
    Join Date
    Jul 2004
    Location
    U.S of freakin' A
    Posts
    1,931
    Quote Originally Posted by Manicdan View Post
    mafia2 has effects that push hardware to the limits just cause smoke is so dense, or everyone and their mother has to wear a cape or trench coat.
    I can't believe you're comparing Mafia 2 to HL2

    The former uses scalable HARDWARE driven physics, while the latter uses software physics only.

    Don't tell me that you're complaining that you can't max out the physics on Mafia 2 on your hexcore processor, when the game's physics was designed to be run on a GPU and not a CPU..

    Seriously..
    Intel Core i7 6900K
    Noctua NH-D15
    Asus X99A II
    32 GB G.Skill TridentZ @ 3400 CL15 CR1
    NVidia Titan Xp
    Creative Sound BlasterX AE-5
    Sennheiser HD-598
    Samsung 960 Pro 1TB
    Western Digital Raptor 600GB
    Asus 12x Blu-Ray Burner
    Sony Optiarc 24x DVD Burner with NEC chipset
    Antec HCP-1200w Power Supply
    Viewsonic XG2703-GS
    Thermaltake Level 10 GT Snow Edition
    Logitech G502 gaming mouse w/Razer Exact Mat
    Logitech G910 mechanical gaming keyboard
    Windows 8 x64 Pro

  18. #18
    I am Xtreme
    Join Date
    Dec 2007
    Posts
    7,750
    Quote Originally Posted by Carfax View Post
    I can't believe you're comparing Mafia 2 to HL2

    The former uses scalable HARDWARE driven physics, while the latter uses software physics only.

    Don't tell me that you're complaining that you can't max out the physics on Mafia 2 on your hexcore processor, when the game's physics was designed to be run on a GPU and not a CPU..

    Seriously..
    you seriously need to read things again. havok is built for CPUs, and Physx is bult for GPUs, we both know this, and we both just acknowledged this.

    you seemed to have ignored the last sentence
    "havok is not the same as phsyx at all. both have different histories and different methods for implementation, and should be judged independently "

  19. #19
    Xtreme Addict
    Join Date
    Apr 2007
    Posts
    1,870
    Quote Originally Posted by Manicdan View Post
    havok is built to work on CPUs and is not overly saturated.
    I really think you're confusing APIs with the use of those APIs. There's nothing inherent about Havok that makes it "not overly saturated". You can easily code Havok routines that bog down your CPU.

    havok is not the same as phsyx at all. both have different histories and different methods for implementation, and should be judged independently
    What does any of that has to do with your laundry list of things PhysX must do but other APIs don't need to, in order to get your approval? You are arbitrarily setting the bar higher for PhysX because that's the only way you can continue to criticize it while ignoring the fact that other APIs are in the same boat.

  20. #20
    Xtreme Enthusiast
    Join Date
    Apr 2006
    Posts
    939
    Quote Originally Posted by trinibwoy View Post
    Nvidia didn't choose to use x87, they bought Novodex that way. They didn't go out of their way to do anything, they just didn't go out of their way to improve it. At least try to do some research guys before jumping on the Nvidia hating bandwagon
    Any programmer would have at least looked at SSE recompile over x87, even at only a 8% gain the fact the only excuse is the guy we bought it from didn't still isn't enough.

    You want to know why I think they won't do it? Because in a couple of years time once AMD and Intel get their 'fusion' act together CPU's will have a GPU (with a much lower latency, and direct use of system memory) on die that will make GPU physics very interesting.

    Next year everyone will be a nvidia fanboy again, nvidia always come back from a ass kicking. But will we be talking Physx or will Intel be showing off a next gen Havok built to fusion type CPU's as another direct attack on nvidia?

  21. #21
    I am Xtreme
    Join Date
    Dec 2007
    Posts
    7,750
    ^^i never set the bar for Havok, i ONLY discussed the 3 things that i think would let nvidia gain alot of respect from the public.

    and your right that you can make a game that forces Havok to bog down a cpu, but you dont because you know that would be stupid since it cant be supported (unless your goal is to have an ultrahigh setting for future use). however there seems to be a clear hand pushing on developers using physx to do certain things to make the cpu bog. many newer physx games have been constantly identified as bloated. so i suggest that settings should be available to tweak each major piece of the physics (how dense smoke is, how many points are on cloth, should cloth animation be on all users or just main character). let people pick what they like, or let people feel immersed into the game.

    my "laundry list" is just for physx because havok has completely different problems and some might be worse, or just as bad, but still different.

    SUVs roll over easier
    compact cars get completely crushed easily
    should we crash test both identically because they are both automobiles? no, you try to identify the weakest things and fix them.

  22. #22
    I am Xtreme
    Join Date
    Dec 2007
    Posts
    7,750
    darn thing double posted

  23. #23
    Xtreme Addict
    Join Date
    Apr 2007
    Posts
    1,870
    Quote Originally Posted by Iconyu View Post
    Any programmer would have at least looked at SSE recompile over x87, even at only a 8% gain the fact the only excuse is the guy we bought it from didn't still isn't enough.
    When there's some real analysis on the cost and effort required to improve PhysX performance on the CPU then let's talk. Pulling out a random 8% number out of thin air doesn't work in the real world. For argument's sake let's say there is a magical 8% performance increase and that PhysX is 100% of the CPU workload. So those 15fps numbers become 16fps...hoorah!!!

    For the run of the mill stuff that Havok does, PhysX performs just fine. It's only when you try to run the more advanced GPU enabled stuff on the CPU that it struggles. But let's not let those simple facts get in the way of Nvidia bashing.

  24. #24
    Banned
    Join Date
    Jan 2003
    Location
    EU
    Posts
    318
    Well, nvidia, if you cant program , open source it.I bet few folks would make it work well in a matter of months.
    But wait, you wont do that because ?

    Because you cant cripple non nvidia systems that way.got ya.

  25. #25
    Xtremely Kool
    Join Date
    Jul 2006
    Location
    UK
    Posts
    1,875
    Quote Originally Posted by trinibwoy View Post
    I really think you're confusing APIs with the use of those APIs. There's nothing inherent about Havok that makes it "not overly saturated". You can easily code Havok routines that bog down your CPU.
    Indeed, but its funny that its nearly impossible for developers to write Physx routines that don't bog down the GPU.

Page 1 of 9 1234 ... LastLast

Bookmarks

Bookmarks

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •