Quacker&nbsp;News

True, virtual staging is a very established product in the real estate media market.

reply

matsemann 1 day ago | next

The example looks very good. Do you have more images to share? I think more examples would be nice to show off more of what it can handle. Different room types, interiors etc.

Also in that regards: I'm curious about what it can't handle. Any situations where it borks?

reply

Excellent suggestion. Will find time tomorrow to add a `/gallery` page. Created an issue to track: https://github.com/fill3d/fill/issues/1 . Best first issue :D

reply

reichardt 1 day ago | next

Amazing! The inserted objects are renders of textured 3D models and not generated by a diffusion model + ControlNet? Is there a fixed set of textured 3D models available or are they generated on the fly based on the prompt?

reply

Thats correct! Right now, we're using the Blenderkit dialog, but we can expand beyond it. When you type a prompt and search though, that's actually doing a multi-modal search (so you can ask for a 'red painting' and it'll actually find a red painting), so it's insanely more accurate than a regular search. AI everywhere!

reply

pj_mukh 1 day ago | next

Super cool! Layout estimation for deprojection is a GNARLY problem especially because people love white textureless walls.

Tried some on fill3D from a dataset we had before (happy to share more), and yup: https://imgur.com/a/Ut2GwZ0

Tough Tough Tough!

fxn.ai looks super cool too, I might try it out!

reply

Would love to get hands on that dataset, how can I reach you? Or, shoot me a note at [email protected]

reply

mentos 1 day ago | next

My use case for this would be for decorating my apartment.

I’ve got a big empty studio with a bed and couch I’ve already purchased but trying to figure out what to fill in for all the other gaps. Coffee table, media console, tv or UST projector, bar or bookshelf or desk.

Would be nice if there was a way to populate it with items/products that can be purchased and aren’t purely conceptual.

reply

Yup this is actually a roadmap feature. Because we generate in 3D, users can bring their own 3D models and add it to the catalog. And if you add something like object capture from Apple (https://developer.apple.com/augmented-reality/object-capture...), you could literally scan your couch, upload it to Fill 3D, place, and generate.

Exciting times ahead.

reply

mft_ 1 day ago | next

I’ve not tried it yet, but came across this site the other day which meets your use case: https://aihomedesign.com

(No affiliation!)

reply

RockRobotRock 1 day ago | next

Have real estate companies considered leaving a house unfurnished and letting potential buyers put on AR goggles to see what it would look like with their furniture?

reply

sci_prog 1 day ago | next

Or could just use a phone/tablet as a "viewport". I know it wouldn't be as immersive but the barrier to have it adopted would be a lot lower.

reply

thih9 1 day ago | next

This level of realism seems impossible in AR as of today, if path tracing a single frame takes a minute or more.

reply

You could potentially get much faster generations (on the order of seconds) while sacrificing realism by switching away from path tracing.

reply

thih9 2 hours ago | next

How would that be different from existing solutions, like ARKit?

reply

linsomniac 23 hours ago | next

What are you thinking is your business model? I'm a sysadmin at a small MLS, trying to figure out where we'd integrate it. At $2/stage it's something we'd probably have to have you bill the Realtor directly for (I don't think we do any pass-through billing), but could maybe include a couple stages per month per Realtor. I could see a fun use-case where consumers would be able to do their own staging, but there are probably few if any Realtors that will be willing to pay $2/stage for consumers to do that.

reply

Would love to have a proper convo on this. With bulk pricing, I can reduce the price by quite a lot. Eventually, the goal is to have users be able to stage themselves in your property website or MLS. Please shoot me a note at [email protected] !

reply

qingcharles 1 day ago | next

Will it work with decks and porches?

I have images of decks and porches that need staging for the construction company's web site.

reply

billconan 1 day ago | next

I tried the demo, it seems to be buggy and it seems to only allow you to choose existing items from a predefined db.

reply

What bugs did you encounter? And yes, because we're using actual 3D models, there's a fixed set of models (right now, just under 300). Because the priority is ultra-realism, the current state-of-the-art for 3D model diffusion won't cut it (see OpenAI Point-e https://github.com/openai/point-e).

reply

billconan 1 day ago | next

so you only project the background into a 3d model, and the foreground is not generated, but 3d models?

the bug I saw was after uploading a background image, on the right side, I only saw a generate and a reset button, nothing else. I clicked "generate", expecting it to ask me to input a prompt, but it started to render and the result was the same background I uploaded.

reply

Yes that's correct. And for using it, you have to draw rectangles before you can add a prompt, similar to Photoshop's generative fill UI. Check out the video on the landing page. Lmk if you face further issues, and sorry about the lacking instructions (I'm not a great webdev).

reply

billconan 20 hours ago | next

maybe have a warning when no rectangles are drawn, I kinda wasted one credit by rendering an empty background.

reply

tinytera 1 day ago | next

Pretty awesome for first Show HN. Multimodal search is very fascinating. I am using SDXL + LoRa model over here https://news.ycombinator.com/item?id=37696033

reply

Thank you! The audience is definitely highly technical, so this has been a very productive thread.

reply

pedalpete 1 day ago | next

Now create a bunch of perspectives, and nerf or guassian splat that, and you've got a fully immersive 3D scene that is better than any rendering.

reply

I like the way you think ;)

reply

jayd16 1 day ago | next

Why is it better than any rendering?

reply

dwallin 23 hours ago | next

In this case it’s likely not. The advantage of Gaussian splats is that it allows you to bake in advanced lighting effects for a static scene. If you already have detailed renders there are plenty of existing approaches that perform plenty well and can be far more optimized.

reply

Cos it's immersive (and interactive). Check out this realtime demo of 3DGS in Unity by Aras P (co-founder of Unity): https://www.youtube.com/watch?v=0vS3yh908TU&ab_channel=ArasP...

reply

jayd16 1 day ago | next

Are they just saying a 3D scene is better than a 2D rendering? I can't help but think a realtime 3D render could be just as good and probably better.

reply

philipov 1 day ago | next

It looks like a cloud-only app. If it doesn't run entirely locally, it's useless to me. Shipping my data to an external data processor is a security risk I'm not allowed to take.

reply

That's fine. Path tracing in a browser is pretty impractical today anyway. Check back in a few years, when WebGPU is much more mature.

reply

aantix 1 day ago | next

Is there any way to remove objects from an initial image, so that then it can be utilized for staging?

reply

Not right now, but that's a great roadmap feature. It should be trivial with today's model (object detection + inpainting). Created an issue: https://github.com/fill3d/fill/issues/2

reply

kderbyma 1 day ago | next

Live the project, great work! can you think about adding some ethical clauses to your license. Something to allow people to use it for good wholesome purposes, but to avoid letting it be used for scammers faking AirBnB listings for example

reply

doix 1 day ago | next

If someone is willing to scam people on Airbnb, I'm pretty sure they're willing to break a software license.

reply

So that's a good reason to provide people who have no capacity to create fake images an instant way to do so, while riding on the back of things the owner has no idea how they would actually create if they were asked to do so? Sweet. Let's all steal other people's property, charge for APIs and then take $15 a hit to let scammers use it.

Yes, they'll break a software license and use garbage that uses garbage that uses garbage. Way to draw the line.

reply

doix 22 hours ago | next

I'll be honest, I don't really get your reply. I was merely saying that adding a clause in the license is pretty pointless. The hypothetical user has already decided to break one (or more) law(s), they wouldn't even think twice to break a software license (probably won't even read it).

Your comment sounds like a criticism of the project in general rather than the pointlessness of adding a clause to the license. Personally, I think this is pretty novel, better than the 100's of stable-diffusion-as-a-service things that have popped up lately.

> while riding on the back of things the owner has no idea how they would actually create if they were asked to do so

I mean, everyone builds on top of things they couldn't recreate. If you're a software developer, chances are you couldn't recreate your favorite languages runtime/compiler/whatever, you couldn't recreate your OS, you couldn't recreate the hardware that's running your software. I don't get this criticism at all.

reply

This is a very good point. Thanks for bringing this up!

reply

init2null 1 day ago | next

Wouldn't that qualify as a crime already? That sounds like fraud to me.

reply

"someone took the bed out"

FWIW this isn't my problem with this project. It's that the writer doesn't know what they're doing and represents a new type of post-code/post-crypto monkey that just links together APIs in clever ways and tries to charge maximum $ for it by selling it to people (monkeys?) who think it's magic.

People like this will make a lot of money, and eventually do something that injures you and your family personally. So it's best to attack them and slander them early and often.

reply

ralfhn 1 day ago | next

> virtual staging in real estate media If you can make this work with exteriors, Landscaping design is huge. Maybe start with something simple like desert landscaping (which is really just rocks, turf, Pavers, maybe small palm trees)

reply

Very curious to learn more, how can I reach you? Or, shoot me an email: [email protected]

reply

idank 1 day ago | next

What did you use to create the screencast at https://www.fill3d.ai/?

reply

aroopchandra 1 day ago | next

I think its https://www.screen.studio

reply

llwj 1 day ago | next

I'm curious about that too. Recently, I've seen many screencasts in the same style, and I hate them. The constant movement of the recorded area is quite distracting.

reply

Screen Studio!

reply

blovescoffee 1 day ago | next

Could you speak more to the "deprojection" step? What is that?

reply

https://investor.wayfair.com/news/news-details/2023/Wayfair-...

Fill 3D takes a different step from diffusion, in that it tries to build an actual 3D scene (kinda like a clone) of what's in the image you upload. In some sense, that's actually the most fundamental representation of what's in your image (or said another way, your image is just a representation of that original scene).

So it works by trying to estimate a 3D 'room' that matches your image. Everything from the geometry, to the light fixtures, to the windows. It's heavily inspired by how humans (weird to contrast 'human' vs. AI work) do image/video compositing.

TL;DR: Image in, 3D scene out.

reply

blovescoffee 1 day ago | next

Could you elaborate on how that's done technically? I'm curious how you estimate the 3D room. Are you using ML based estimation like LayoutNet? How about the lighting?

reply

moritonal 1 day ago | next

You realise this is the role of entire teams at certain companies right? If you automate enough parts you'd do able to automate the work of 30 people per company doing this. Not the first to work this out either.

reply

Decorify from Wayfair is also using diffusion, same as the other folks who have built similar things in the market (InteriorAI is probably leading product here). We'll see where this goes :D

reply

ugh123 1 day ago | next

Can this be used to replace objects in a scene? In your demo example you place a bed, but what if I want to replace my bed with yours?

reply

Potentially, by inpainting the existing object (use SD to remove it) then filling in the blank space. This makes a good roadmap feature.

reply

ugh123 1 day ago | next

Sell that to IKEA

reply

tamimio 1 day ago | next

I like it, but should have added some free tier to test it out.

reply

Currently it's two images free.

reply

prashp 1 day ago | next

Nice! Like your landing page.

How well does it work on non-room images?

reply

Depends on the image. Right now, the very first stage (deprojecting the image to 3D) makes assumptions about the image having the structure of a room: large empty floor plan; roughly polygonal geometry.

For different kinds of images, it's a question of using other cues to build a 3D structure that's very close to the original image. And no, monocular depth estimation isn't enough (happy to nerd out about why) ;)

reply

blovescoffee 1 day ago | next

Ha, okay why isn't monocular depth estimation sufficient?

reply

Cos to accurately match the background, you need to estimate the original camera's characteristics as closely as possible, otherwise perspective looks off.

reply

readyplayernull 1 day ago | next

Very cool! The challenge is now filling spaces with different lighting, i.e. sunlight entering a window in a mostly dark room while a lamp illuminates a wall.

reply

I think this isn't too difficult of a problem. Technically, the objects that get added could be emissive. It could even be a switch, having an added light be on or off.

reply

artursapek 1 day ago | next

Wow, nice. I hope you charge realtors a fat price for this

reply

Lmao I guess that's an option.

reply

frozenport 1 day ago | next

Really can't generate the object I need to place. Few that don't work 1. terrarium 2. fish tank 3. bunk bed

reply

bsenftner 1 day ago | next

Between Fill3D's architecture that 'path traces to render ultra-realistic results' and fxn.ai transparent deployment capability... I gotta say this is super impressive work. I can use both in a current project, and will be investigating.

reply

Thank you!

reply

voiceblue 1 day ago | next

> Right now, you need an image of an empty room

I needed an image of an empty room recently. I just took a photo of my very not empty room, ran it through a canny algorithm, painted out the objects with black, and then used stable diffusion with canny controlnet to generate an empty room. Worked pretty well. Did not look that much like the original room, but it was certainly good enough to check furniture placement etc.

reply

lionkor 1 day ago | next

SketchUp may be for you in that case

reply

reply

ugh123 1 day ago | next

This kind of stuff is the future of film making.

Imagine adding "yourself" into a scene like this, moving around as you were/are from a video you just created of yourself. As in: film yourself walking around your bedroom with your phone. Then use an app like this to add you and your movement (cropped from the video) to a different background scene.

Goodbye, Hollywood elites!

reply

https://twitter.com/GabRoXR/status/1706691466460836333?t=3z7...

I couldn't agree more! You should check out the amazing work from the folks at Luma Labs (https://lumalabs.ai/). They're a loose inspiration for this project.

reply

jimmySixDOF 1 day ago | next

This is an excellent example of a full pipeline from blender <> luma tools <> 3D in ShapesXR (who are also doing amazing work atm)

reply

sexy_seedbox 1 day ago | next

Kill Hollywood: https://web.archive.org/web/20120121035139/http://ycombinato...

reply

spacecrafter3d 1 day ago | next

This is what I’m working on at https://skyglass.com. You should check it out!

reply

Hey, I'd love to chat with you about how you power these on-device AI features (like background replacement). Function is building infrastructure for both server-side and on-device AI inference.

The goal is for devs like you to bring your original Python code, and we'll generate a library that is cached and runs on-device. See this demo: https://demos.natml.ai/@natml/blazepalm-landmark (wave your hands)

reply

hoosieree 1 day ago | next

[pitching to investors]

It's like Joan is Awful, but for AirBnB.

reply

chefandy 1 day ago | next

> This kind of stuff is the future of film making.

> Imagine adding "yourself" into a scene like this, moving around as you > were/are from a video you just created of yourself. As in: film yourself > walking around your bedroom with your phone. Then use an app like this to > add you and your movement (cropped from the video) to a different > background scene.

> Goodbye, Hollywood elites!

As someone working in this arena, statements like this make me chuckle.

Don't get me wrong-- I think this is really cool. I get that people are excited about new tech, and technical people always overestimate the value of technical advancements in creative workflows, but no: people being able to place a perfect hyper-realistic replica of themselves in a film wouldn't kill the film industry any more than RPG Maker + generative AI would kill the games industry. I'd wager it probably would not even leave a dent.

Firstly, there would need to be a film to begin with, and that requires a lot. A whole hell of a lot.

Secondly, characters matter. A lot. Especially main characters. Do you replace the name of the main character with your name in stories you tell? How about doing a global search-and-replace in the ebooks you read? It's not like we don't have the technical capability. I could see this being a novelty feature in some action movies, especially superhero movies, and more likely in games and porn, but one of the biggest draws a movie has is who stars in it-- and if you look at the rest of the human population, you'll notice that we're not choosing representative samples. The fact that it's someone else with a personality and back story and motives and strengths and flaws-- a character-- is a pretty important part of stories. Their appearance matters, too. Most people don't even like staring at themselves for a few minutes, let alone for an entire feature length film. In most situations, I think it would be distracting as hell. Sure, people might find it amusing to see themselves in Top Gun Maverick, but would they want to see themselves getting bullied by an IRS agent in Everything Everywhere All at Once? Getting a box cutter held to their throat in Emily the Criminal? Would replacing Jon Hamm's appearance with their own really make watching Madmen better? Do most people want to see themselves beat to a pulp in Fight Club? I'd wager that few would.

Thirdly, most people aren't particularly interested in putting in the thought and effort to customize their phones: I'm pretty sure they're less interested in putting thought and effort into customizing their passive entertainment. They just want to hit play and have a nice little escape.

So, no. As long as people will continue to seek entertainment for the reasons they've always sought it, this is not going to fundamentally change the art of storytelling anytime soon.

reply

reply

SeanAnderson 1 day ago | next

That's an awfully brown colored pink bed in the demo :)

The tech itself looks amazing though, well done.

reply

I was hoping you wouldn't notice ;)

Don't worry, there's more options. But thank you!

reply

reply

pjs_ 1 day ago | next

Love the "No need for nasty YAMLs or Dockerfiles" copy on the Function website. Plus ca change plus c'est la meme chose. HTMX, SQLite, Postgres are hip. Building giant supercomputers is back in, fuck the edge. Even starting to see a new XML wave.

Today watched a video about gravelbike touring where some young whippersnapper was getting mad excited about the idea of putting a rack and panniers on the back of their bike - just like in the good old days. What a world we live in. I'm 100% old af

reply

reply

lee101 1 day ago | next

[dead]

reply

RIMR 1 day ago | next

Very brave to show us that ugly brown bed generated from the prompt "pink bed".

reply

Seems like I'm in the middle of a controversy :D

reply

reply

[flagged]

reply

tantalor 1 day ago | next

Video is nauseating

reply

xwdv 1 day ago | next

I gasped. This is what will make it trivial to simply highlight a persons swimwear and tell the AI to remove.

reply

ad-astra 1 day ago | next

Eh, at that point you might as well just get tickets for a burlesque show

reply

k12sosse 1 day ago | next

Have you never used stable diffusion?

Today, as in right now, with less than 5 relatively not-horrible photographs you can create a realistic AI version of anyone to do anything you'd like them to, or wear. Animation included. From your home computer.

Or just inpaint the clothes away from any image.

reply