Ganpati Bappa Goes Digital: Creating an AR Ganesha Idol with Generative AI

Augmented reality Ganesha idol generated by AI, displayed on a smartphone screen overlaid on a home environment, showcasing a blend of technology and tradition for Ganesh Chaturthi

Imagine welcoming Lord Ganesha into your home through your smartphone screen. This Ganesh Chaturthi, I combined devotion with technology to create an augmented reality Ganesha idol using only AI tools. Here’s how I did it.

I’ve been itching to work on a small project over the weekend, and with Ganesh Chaturthi upon us, I thought, “Why not show my love for Lord Ganesha in an AI-trending way?” So, I embarked on a journey to generate an AR Experience of Lord Ganesha at my home, all using Generative AI.
Here’s my workflow:

Step 1: Crafting the Perfect Ganesha Image

I started by creating multiple images of the Ganesha Idol. I used the same prompt on both Leonardo and Midjourney:

“Highly detailed origami Ganesha idol, hyper realistic on a white background”

Hyper-realistic AI-generated Ganesha idol with an origami-inspired design, created using Midjourney, on a clean white background

AI-generated origami-style Ganesha idol created using Leonardo AI, showing intricate paper-like folds on a white background

After comparing the results, I settled for the Midjourney version. It seemed to capture the essence I was looking for.

Detailed AI-generated Ganesha idol with an origami aesthetic, featuring intricate folds and a serene expression, chosen for the AR project

Step 2: Transforming 2D to 3D

To bring Ganesha to life, I needed to convert the 2D image into a 3D model. I headed to Hugging Face and found two promising tools:

SF3D: Stable Fast 3D Mesh Reconstruction with UV-unwrapping and Illumination Disentanglement (https://huggingface.co/spaces/stabilityai/stable-fast-3d)
InstantMesh Image to 3D (https://huggingface.co/spaces/jkorstad/InstantMesh-img-to-3D)

Screenshot of the SF3D tool interface on Hugging Face, showing the 2D to 3D conversion process of the Ganesha idol

Screenshot of the InstantMesh Image to 3D tool on Hugging Face, displaying the 3D mesh generation of the Ganesha idol

After experimenting with both, SF3D won me over. It provided better mesh quality and offered an option to add environment lighting, which illuminated the 3D Model beautifully.

Step 3: Exporting for AR

I exported the 3D model to GLB format. Why GLB? It’s a 3D file format that’s perfect for augmented reality (AR) and web applications, supporting motion and animation.

Step 4: Creating an Interactive Web Experience

Now came the challenge of generating an HTML page to embed the 3D GLB file and create an interactive experience. I turned to Claude 3.5 Sonnet for help, as I’ve found it particularly adept at coding-related tasks compared to ChatGPT 4o.

My prompt for Claude was:

“I have a .glb file that I want to show in an HTML browser and interact with it. The code should ideally ask to activate my camera and request me to place the object on a surface visible in the camera feed. You can refer to AR code at https://modelviewer.dev/. Please add a feature to let me scale the 3D object as well.”

Screenshot of the conversation with Claude AI, showing the prompt for generating HTML code for the AR experience

Step 5: Bringing It All Together

With the code from Claude in hand, I made a few tweaks:

Downloaded the file
Uploaded it to my server
Edited the HTML to point to my GLB file
Added my own title

And voilà! The result is live at: https://abhilashkrishnan.com/innovation/webar/Ganesha/

That’s all there is to it! Happy Ganesh Chaturthi to everyone. May Bappa bless you all with joy, luck, and prosperity.

Have you experimented with AR or AI for festive celebrations? I’d love to hear about your experiences in the comments below!

Update: Exploring Another 3D Conversion Tool

After completing this project, I stumbled upon another fascinating tool for converting 2D images to 3D models: Meshy.ai (https://www.meshy.ai/workspace/image-to-3d). This discovery opened up an interesting comparison and some thought-provoking questions about AI-generated 3D models.

The unique aspect of Meshy.ai is that its AI model seems to modify the image by applying its own creative interpretation during the 3D generation process. While the resulting 3D elements are refined, they don’t exactly match the original image and appear creatively modified.

You can experience and compare the 2nd AR version here: https://abhilashkrishnan.com/innovation/webar/Ganesha/v2

When you explore both versions, you’ll notice distinct characteristics:

The original version using SF3D stays truer to the 2D image.
The new version using Meshy.ai introduces some creative interpretations in the 3D rendering.

This comparison highlights an intriguing aspect of AI-generated 3D models: the balance between accuracy and creative interpretation. Depending on your project goals, you might prefer one approach over the other.

It raises some interesting questions:

In AI-assisted art and modeling, how important is strict adherence to the original image?
Can AI’s creative interpretations enhance the final product in some cases?
How do we balance human artistic intent with AI’s generative capabilities?

I’d love to hear your thoughts on this! Have you experimented with different 2D to 3D conversion tools? What has been your experience with AI’s creative interpretations in 3D modeling?

Feel free to share your experiences or opinions in the comments below. Your insights could be valuable for anyone exploring the exciting intersection of AI, AR, and creative expression!

Cancel reply

The Nano Banana Revolution: Google’s Secret Weapon for Mobile Advertising

Cross-Device Segmentation vs. Single-Device Targeting

Generative AI in Video Ads: Key Use Cases