Current version

v1.10.4 (stable)

Navigation

Main page
Archived news
Downloads
Documentation
   Capture
   Compiling
   Processing
   Crashes
Features
Filters
Plugin SDK
Knowledge base
Contact info
Forum
 
Other projects
   Altirra

Search

Archives

01 Dec - 31 Dec 2013
01 Oct - 31 Oct 2013
01 Aug - 31 Aug 2013
01 May - 31 May 2013
01 Mar - 31 Mar 2013
01 Feb - 29 Feb 2013
01 Dec - 31 Dec 2012
01 Nov - 30 Nov 2012
01 Oct - 31 Oct 2012
01 Sep - 30 Sep 2012
01 Aug - 31 Aug 2012
01 June - 30 June 2012
01 May - 31 May 2012
01 Apr - 30 Apr 2012
01 Dec - 31 Dec 2011
01 Nov - 30 Nov 2011
01 Oct - 31 Oct 2011
01 Sep - 30 Sep 2011
01 Aug - 31 Aug 2011
01 Jul - 31 Jul 2011
01 June - 30 June 2011
01 May - 31 May 2011
01 Apr - 30 Apr 2011
01 Mar - 31 Mar 2011
01 Feb - 29 Feb 2011
01 Jan - 31 Jan 2011
01 Dec - 31 Dec 2010
01 Nov - 30 Nov 2010
01 Oct - 31 Oct 2010
01 Sep - 30 Sep 2010
01 Aug - 31 Aug 2010
01 Jul - 31 Jul 2010
01 June - 30 June 2010
01 May - 31 May 2010
01 Apr - 30 Apr 2010
01 Mar - 31 Mar 2010
01 Feb - 29 Feb 2010
01 Jan - 31 Jan 2010
01 Dec - 31 Dec 2009
01 Nov - 30 Nov 2009
01 Oct - 31 Oct 2009
01 Sep - 30 Sep 2009
01 Aug - 31 Aug 2009
01 Jul - 31 Jul 2009
01 June - 30 June 2009
01 May - 31 May 2009
01 Apr - 30 Apr 2009
01 Mar - 31 Mar 2009
01 Feb - 29 Feb 2009
01 Jan - 31 Jan 2009
01 Dec - 31 Dec 2008
01 Nov - 30 Nov 2008
01 Oct - 31 Oct 2008
01 Sep - 30 Sep 2008
01 Aug - 31 Aug 2008
01 Jul - 31 Jul 2008
01 June - 30 June 2008
01 May - 31 May 2008
01 Apr - 30 Apr 2008
01 Mar - 31 Mar 2008
01 Feb - 29 Feb 2008
01 Jan - 31 Jan 2008
01 Dec - 31 Dec 2007
01 Nov - 30 Nov 2007
01 Oct - 31 Oct 2007
01 Sep - 30 Sep 2007
01 Aug - 31 Aug 2007
01 Jul - 31 Jul 2007
01 June - 30 June 2007
01 May - 31 May 2007
01 Apr - 30 Apr 2007
01 Mar - 31 Mar 2007
01 Feb - 29 Feb 2007
01 Jan - 31 Jan 2007
01 Dec - 31 Dec 2006
01 Nov - 30 Nov 2006
01 Oct - 31 Oct 2006
01 Sep - 30 Sep 2006
01 Aug - 31 Aug 2006
01 Jul - 31 Jul 2006
01 June - 30 June 2006
01 May - 31 May 2006
01 Apr - 30 Apr 2006
01 Mar - 31 Mar 2006
01 Feb - 29 Feb 2006
01 Jan - 31 Jan 2006
01 Dec - 31 Dec 2005
01 Nov - 30 Nov 2005
01 Oct - 31 Oct 2005
01 Sep - 30 Sep 2005
01 Aug - 31 Aug 2005
01 Jul - 31 Jul 2005
01 June - 30 June 2005
01 May - 31 May 2005
01 Apr - 30 Apr 2005
01 Mar - 31 Mar 2005
01 Feb - 29 Feb 2005
01 Jan - 31 Jan 2005
01 Dec - 31 Dec 2004
01 Nov - 30 Nov 2004
01 Oct - 31 Oct 2004
01 Sep - 30 Sep 2004
01 Aug - 31 Aug 2004

Stuff

Powered by Pivot  
XML: RSS feed 
XML: Atom feed 

§ Problem with SSE4.1 support

I have the day off from work today, so I decided to sit down and try some SSE4.1 experiments... only to discover the following:

Sigh.

It's not a dealbreaker, since using MASM macros isn't bad and I'm used to RDTSC profiling, but sheesh... talk about missing prerequisites.

Comments

Comments posted:


1. There is VTune 9.0+ available
4. It can (SSSE3) if you use macros from Agner Fog [http://www.agner.org/optimize/macros.zip]
6. AFAIK you have latest MASM in Windows Server 2008 SDK

As for missing prerequisites, hopefully it won't be the same with AVX:
http://softwarecommunity.intel.com/isn/d..

:-)

Igor Levicki (link) - 22 04 08 - 09:45


I've used VTune 9.0... aside from the fact that it would cost me $700 to get a personal license for that version, I get the impression that it sucks. It kept crashing or giving profiles with half the symbols resolved that were mysteriously fixed by a re-link. For some reason you also can't set the initial columns and it keeps coming up with useless data like segment address, which is lame.

Unfortunately, Agner doesn't have SSE4.1 macros. I took a shot at doing them and came to the conclusion that they'd be difficult to implement cleanly for certain instructions, most notably PMOVZXBQ, due to the lack of similar instructions.

I'd rather not require the Windows Server 2008 SDK, since you have to hack it in order to get a clean build with VS2005 SP1.

As for AVX... I'd love to play around with that, since two address instructions suuuuuuck, but they have to release a CPU with it first and then I'd have to get one. I didn't realize the AVX reference was out, but looking at it.... yeesh. I mean, the stuff they're adding is cool, but the x86 mess just got even bigger. You've now got a 256-bit load called VMOVDQA (note that the DQA part originally meant double quadword aligned or 128-bit), and absurdly long instruction mnemonics like VBROADCASTF128. I'm just glad my disassembler is parser based because Intel just made the x86 instruction stream more annoying with the addition of a VEX prefix.

Phaeron - 23 04 08 - 01:22


oops, i mistakingly posted this on a newer blog post, so here goes again:

ID software had a similar problem (needed sse3 support before their compilers had it). They solved it with clever use of regular VC macros:

http://www.koders.com/cpp/fidDF1DB7A4FCE..

OfekSH - 23 04 08 - 09:43


Hey, AVX looks interesting, I could kill for 256 bit wide regs and fp madd :P

Gabest - 23 04 08 - 20:15


Me too. Soon they will release software emulator for AVX.

Igor Levicki (link) - 27 04 08 - 14:59


"MASM 9.0 is included in Visual C++ 2008 SP1 Express Edition."
http://connect.microsoft.com/VisualStudi..

VC++ 2008 SP1 link: http://www.microsoft.com/downloads/detai..

JuanR - 31 01 09 - 12:55

Comment form


Please keep comments on-topic for this entry. If you have unrelated comments about VirtualDub, the forum is a better place to post them.
Name:  
Remember personal info?

Email (Optional):
Your email address is only revealed to the blog owner and is not shown to the public.
URL (Optional):
Comment: /

An authentication dialog may appear when you click Post Comment. Simply type in "post" as the user and "now" as the password. I have had to do this to stop automated comment spam.



Small print: All html tags except <b> and <i> will be removed from your comment. You can make links by just typing the url or mail-address.