Uli's Web Site
[ Zathras.de - Uli's Web Site ]
Other Sites: Stories
Pix
Abi 2000
Stargate: Resurgence
Lost? Site Map!
 
 
     home | blog | moose | programming | articles >> blog

 Archive
 
 Blog Topics
 

15 Most Recent [RSS]

 Less work through Xcode and shell scripts
2011-12-16 @600
 
 iTunesCantComplain released
2011-10-28 @954
 
 Dennis Ritchie deceased
2011-10-13 @359
 
 Thank you, Steve.
2011-10-06 @374
 
 Cocoa Text System everywhere...
2011-03-27 @788
 
 Blog migration
2011-01-29 @520
 
 All you need to know about the Mac keyboard
2010-08-09 @488
 
 Review: Sherlock
2010-07-31 @978
 
 Playing with Objective C on Debian
2010-05-08 @456
 
 Fruit vs. Obst
2010-05-08 @439
 
 Mixed-language ambiguity
2010-04-15 @994
 
 Uli's 12:07 AM Law
2010-04-12 @881
 
 Uli's 1:24 AM Law
2010-04-12 @874
 
 Uli's 6:28 AM Law
2010-04-12 @869
 
 Uli's 3:57 PM Law
2010-04-12 @867
 

More...

Removing transparency from NSImage

[Example of a layered image with boxes around the mostly-transparent images]I played around with a little test project using layer-backed NSImageViews for its graphical display. The hit-testing bit me just when I ran out of time, but a nice little NSImage category came out of this.

It's not very fast, but it takes an NSImage and creates a copy with all the fully transparent areas around it removed. It also gives you a rectangle, indicating at what position (relative to the original upper left of the image) the new image lies.

So if you've created a layered image in Pixelmator, where every object in your scene is on its own layer, you can just export each layer as a full-size PNG, then use this category to cut them down to their actual size and calculate the rectangle the object will need on your screen. Much more fun than doing this manually.

The basic trick is doing a lockFocus on your image, and then looping over the coordinates, doing an NSReadPixel() for each pixel. And there lies the problem: NSReadPixel() creates an NSColor object for each pixel, and tries to interpolate pixels in the image if you end up in a coordinate between them (which I'm sure the code below probably still does).

I tried creating an NSBitmapImageRep with a fixed depth and examining its pixels using -getPixel:atX:y:, but that didn't seem much faster. Copying the whole image and resampling is apparently just as slow as creating a bunch of NSColors. I guess one would have to write code that can work on the actual image data, and is capable of examining any depth, maybe drop down to CGImageRef to get more performance out of this, but for an automated batch preprocessing tool, this is already suitable. Here's the code:

@interface NSImage (UKRemoveTransparentAreas)

-(NSImage*) imageByRemovingTransparentAreasWithFinalRect: (NSRect*)outBox;

@end

@implementation NSImage (UKRemoveTransparentAreas)

-(NSImage*) imageByRemovingTransparentAreasWithFinalRect: (NSRect*)outBox
{
NSRect oldRect = NSZeroRect;
oldRect.size = [self size];
*outBox = oldRect;

[self lockFocus];

// Cut off any empty rows at the bottom:
for( int y = 0; y < oldRect.size.height; y++ )
{
for( int x = 0; x < oldRect.size.width; x++ )
{
NSColor* theCol = NSReadPixel( NSMakePoint( x, y ) );
if( [theCol alphaComponent] > 0.01 )
goto bottomDone;
}

outBox->origin.y += 1;
outBox->size.height -= 1;
}

bottomDone:
// Cut off any empty rows at the top:
for( int y = oldRect.size.height -1; y >= 0; y-- )
{
for( int x = 0; x < oldRect.size.width; x++ )
{
NSColor* theCol = NSReadPixel( NSMakePoint( x, y ) );
if( [theCol alphaComponent] > 0.01 )
goto topDone;
}

outBox->size.height -= 1;
}

topDone:
// Cut off any empty rows at the left:
for( int x = 0; x < oldRect.size.width; x++ )
{
for( int y = 0; y < oldRect.size.height; y++ )
{
NSColor* theCol = NSReadPixel( NSMakePoint( x, y ) );
if( [theCol alphaComponent] > 0.01 )
goto leftDone;
}

outBox->origin.x += 1;
outBox->size.width -= 1;
}

leftDone:
// Cut off any empty rows at the right:
for( int x = oldRect.size.width -1; x >= 0; x-- )
{
for( int y = 0; y < oldRect.size.height; y++ )
{
NSColor* theCol = NSReadPixel( NSMakePoint( x, y ) );
if( [theCol alphaComponent] > 0.01 )
goto rightDone;
}

outBox->size.width -= 1;
}

rightDone:
[self unlockFocus];

// Now create new image with that subsection:
NSImage* returnImg = [[[NSImage alloc] initWithSize: outBox->size] autorelease];
NSRect destBox = *outBox;
destBox.origin = NSZeroPoint;

[returnImg lockFocus];
[self drawInRect: destBox fromRect: *outBox operation: NSCompositeCopy fraction: 1.0];
[[NSColor redColor] set];
[NSBezierPath strokeRect: destBox];
[returnImg unlockFocus];

return returnImg;
}

@end

And yes, I'm using goto, get over it. It just made the code much more readable, and as long as C doesn't add named loops, so I can say break outerloop;, I guess it'll stay that way. It's not as if I was jumping up and down, so code flow is obvious, and NSAutoreleasePools save me from any scope issues goto might have or not.

Another approach I haven't yet tried to make this work in realtime would be to use CoreImage. Maybe one could build a histogram of certain areas of the image (only the alpha channel) and thus quickly home in on big transparent spots. Anyone wanna chime in?

Reader Comments: (RSS Feed)
Peter Hosey writes:
> I tried creating an NSBitmapImageRep with a fixed depth and examining its pixels using -getPixel:atX:y:, but that didn't seem much faster. The fastest way would be to create a CGBitmapContext with a pixel format of your choice, draw the source image into it, iterate directly on its backing buffer to determine the rect, and then use CGBitmapContextCreateImage and CGImageCreateWithImageInRect to crop out the desired image. I don't think you can use Core Image for this. You already have the alpha channel, and there's nothing a CIFilter can do with it that can help you solve the problem, and with no data-dependent loops, CI Filter Language can't solve the complete problem alone. You would have to dynamically generate the CIFL code to (looplessly) find the bounds of the image.
Ken Ferry writes:
Hey Uli! The recommended way to do something like this is to make a new NSBitmapImageRep in a _known_pixel_format_, draw the image into it, then examine the data of that bitmap. This is fast because you aren't churning through making objects or indirecting through pointers or anything like that. This is safe because the drawing machinery is basically canonicalizing your abstract image into a single format you understand - no need to deal specially with arbitrary depths or arbitrary anything. There's a discussion the AppKit release notes under "NSBitmapImageRep: CoreGraphics impedance matching and performance notes". "So, to sum up: (1) Drawing is fast. Playing with pixels is not. (2) If you think you need to play with pixels, (a) consider if there's a way to do it with drawing or (b) look into CoreImage. (3) If you still want to get at the pixels, draw into a bitmap whose format you know and look at those pixels." This is case 3.
Or E-Mail Uli privately.

 
Created: 2010-01-16 @560 Last change: 2025-10-26 @926 | Home | Admin | Edit
© Copyright 2003-2025 by M. Uli Kusterer, all rights reserved.