Traditional cube detecting graphics project

(website version: https://pklito.github.io/Graphics-Project/)
This project's goal is to recognize and locate cubes in images with traditional edge detecting methods and graphics fundamentals. In these images, the position and orientation of the camera isn't known, however the FOV is constant.

There are several different approaches I explored in order to achieve decent results:

Finding faces and using Perspective N-Point solvers.

The first method consisted of constructing a planar graph out of the edges of the image, finding cube faces by searching for loops in the graph, applying the PNP solver on these faces, and consolidating the results.

Finding the vanishing points of the image

I've explored modelling the vanishing points for an image of a voxel world (grid aligned cubes) in order to find the camera parameters, and then applying the previous face localizing method, but also finding the distance and angle to each detected edge manually.

A graphics engine based in modernGL was also used in order to generate simple testing images for this problem.

Read the writeup below for more in-detail explanations about the process.

Writeup

Read here the full writeup

Video

https://youtu.be/lJQKC-Njt0E

Installation

The project uses python 3. To install the libraries, enter the main directory and run

pip install -r requirements.txt or alternatively pip install pygame opencv-python numpy moderngl matplotlib

Running the code

There are two seperate runnable files, the first is the simulator that can generate cubes for you. The second is the prediction that takes an image and guesses where the cubes are.

Graphics simulator

py run_simulation.py

Controls: ESC - exit P - Pause WASD - move around SPACE/SHIFT - move down/up Mouse - turn camera T - export current frame as 'generated_images/output.png' Y - Print camera matrix and cube positions G - Process the current frame directly

The area has some randomly placed cubes I used to test out different configurations for the image. Pressing G does the LSD graph method of finding cubes without having to run run_prediction.py. Note: To continue, you may have to close the result MatPlotLib provides.

Predictor

py run_prediction.py <image_name> [-d DETECTOR] [-v]

This script processes an image to detect cubes using different detection methods.

Arguments:

image_name: The name of the image file to process.
-d, --detector: The edge detection method to use. Options are lsd (default), hough.
-p, --pipeline Which analysis will be run on the image.
- For the first faces and solvePNP solution, graph
- For the method which localizes edges individually using the vanishing points, lines
- For the mixture, which gets the edges from the vanishing points and uses solvePNP with faces of those edges, use mixed
-v, --verbose: If set, enables verbose mode for detailed images along with the result.

Example usages:

py run_prediction.py generated_images/demo_scarce.png

py run_prediction.py generated_images/demo_scarce.png --detector lsd --verbose --pipeline graph

py run_prediction.py generated_images/demo_scarce.png --pipeline lines

If no image is provided, the script defaults to generated_images/demo_scarce.png with the lsd detector and non-verbose mode.

Detection Methods:

lsd: Line Segment Detector.
prob / hough / houghp: Alias for Probabilistic Hough Transform.

Name		Name	Last commit message	Last commit date
Latest commit History 216 Commits
docs		docs
generated_images		generated_images
objects		objects
shaders		shaders
textures		textures
.gitignore		.gitignore
camera.py		camera.py
constants.json		constants.json
constants.py		constants.py
containers.py		containers.py
cube.png		cube.png
debug.txt		debug.txt
debug2.json		debug2.json
debug2.txt		debug2.txt
debug_parse.py		debug_parse.py
debug_parse_json.py		debug_parse_json.py
graph.py		graph.py
install.txt		install.txt
logger.py		logger.py
matrix_to_angles.py		matrix_to_angles.py
model.py		model.py
opencv.py		opencv.py
opencv_fit.py		opencv_fit.py
opencv_fit_color.py		opencv_fit_color.py
opencv_points.py		opencv_points.py
opencv_renewed.py		opencv_renewed.py
readme.md		readme.md
requirements.txt		requirements.txt
run_prediction.py		run_prediction.py
run_simulation.py		run_simulation.py
sc_rgb.txt		sc_rgb.txt
star.png		star.png
test.py		test.py
test4.png		test4.png
test_rvec.py		test_rvec.py
texture.py		texture.py
util.py		util.py
vbo.py		vbo.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Traditional cube detecting graphics project

Finding faces and using Perspective N-Point solvers.

Finding the vanishing points of the image

Writeup

Video

Installation

Running the code

Graphics simulator

Predictor

Predictor

Arguments:

Example usages:

Detection Methods:

Previews:

About

Languages

pklito/Graphics-Project

Folders and files

Latest commit

History

Repository files navigation

Traditional cube detecting graphics project

Finding faces and using Perspective N-Point solvers.

Finding the vanishing points of the image

Writeup

Video

Installation

Running the code

Graphics simulator

Predictor

Predictor

Arguments:

Example usages:

Detection Methods:

Previews:

About

Topics

Resources

Stars

Watchers

Forks

Languages