Data Structures and Algorithms Go

2025, Jan 25

Chris Zietlow Chris Zietlow Profile Image

8 min read

Algorithm

Simple Sorts

Selection Sort

Sort a list of []Type by searching for the smallest (or largest) element of the array, removing the found element from the list, and copying it to a new list.

Time Complexity: $O(n^2)$

Example

func selectionSort(list []int) []int {
	newList := []int{}
	for range list {
		smallest, smallestIndex := findSmallest(list)
		list = append(list[:smallestIndex], list[smallestIndex+1:]...)
		newList = append(newList, smallest)
	}
	return newList
}

func findSmallest(list []int) (int, int) {
	var smallest, smallestIndex int = list[0], 0
	for i, number := range list {
		if number < smallest {
			smallest = number
			smallestIndex = i
		}
	}
	return smallest, smallestIndex
}

Bubble Sort

Thanks Obama

Starts with the last item in the array, comparing it to n-1. If item is smaller than the item before it in the array then the two items are swapped. The sorted values are "bubbled up"

Time Complexity: $O(n^2)$

Example

func bSort(list []int) []int {
	for start := 0; start < len(list)-1; start++ {
		index := start
		for end := len(list) - 1; end > index; end-- {
			if list[end] < list[end-1] {
				temp := list[end-1]
				list[end-1] = list[end]
				list[end] = temp
			}
		}
	}
	return list
}

Insertion Sort

Search each element in the array, inserting each element into its proper place in respect to the other already sorted elements

Time Complexity: $O(n^2)$

Example

func insertionSort(list []int) []int {
	for i := 1; i < len(list); i++ {
		for j := i; j > 0 && list[j-1] > list[j]; j-- {
			list[j], list[j-1] = list[j-1], list[j]
		}
	}
	return list
}

$O(n\log n)$ Sorts

Quick Sort

A divide-and-conquer sorting algorithm that uses recursion. To perform a quick sort:

Select a pivot element in the array
iterate through the array, move every element greater than the pivot to a one side, and every element less than to the other side of the pivot
Recursively preform the pivot and moving until the resultant sub-arrays are either empty or contain 1 element

The pivot can effect the time complexity, the best case is that both sub-arrays are of equal size.

Time Complexity:
average: $O(n\log n)$
worst: $O(n^2)$

Example

func quicksort(list []int) []int {
	if len(list) < 2 {
		return list
	} else {
		pivot := list[len(list)/2]
		left, right, middle := []int{}, []int{}, []int{}
		for _, value := range list {
			if value < pivot {
				left = append(left, value)
			} else if value > pivot {
				right = append(right, value)
			} else {
				middle = append(middle, value)
			}
		}
		return append(append(quickSortFunctional(left), middle...), quickSortFunctional(right)...)
	}
}

Example using in-place quickSort

In place expands upon the quick sort algorithm, by sorting values while scanning for elements less than and greater than the pivot element value. This method of quick sort is less efficient than above (Hoare's method) due to requiring more swaps and comparison.

func main() {	
	list := []int{5, 1, 2, 18, 3}
	quickSort(list, 0, len(list)-1)
	fmt.Println(list)
}

func quickSort(list []int, low, high int) []int {
	if low < high {
		var pivot int
		list, pivot := partition(list, low, high)
		list = quickSort(list, low, pivot-1)
		list = quickSort(list, pivot+1, high)
	}
	return list
}

func partition(list []int, low, high int) ([]int, int) {
	pivot := list[high]
	i := low
	for j := low; j < high; j++ {
		if list[j] < pivot {
			list[i], list[j] = list[j], list[i]
			i++
		}
	}
	list[i], list[high] = list[high], list[i]
	return list, i
}

Merge Sort

Merge search is another divide and conquer sorting method like quicksort. In order to sort, cut the array into 2 subarray, recursively call mergeSort method, until the sub-arrays have a length less than 2, then combine and sort the final 2 resulting arrays.

Time Complexity: $O(n\log n)$

Example

func mergeSort(list []int) []int {
	if len(list) < 2 {
		return list
	}

	mid := len(list) / 2
	left := mergeSort(list[:mid])
	right := mergeSort(list[mid:])
	return merge(left, right)

}

func merge(leftArray []int, rightArray []int) []int {
	tempArray := []int{}
	i, j := 0, 0
	for i < len(leftArray) && j < len(rightArray) {
		if leftArray[i] < rightArray[j] {
			tempArray = append(tempArray, leftArray[i])
			i++
		} else {
			tempArray = append(tempArray, rightArray[j])
			j++
		}
	}

	// if one array was longer than the other
	for ; i < len(leftArray); i++ {
		tempArray = append(tempArray, leftArray[i])
	}
	for ; j < len(rightArray); j++ {
		tempArray = append(tempArray, rightArray[j])
	}
	return tempArray
}

Searching

Binary Search

Search an sorted list by bisecting it after every lookup action. The effeciency of binary search increases directed related to the size of the list (a function of logrithmic time)

Time Complexity: $O(\log n)$

Example

func binarySearch(list []int, item int) int {
	low := 0
	high := len(list)
	for low <= high {
		// go will round here, bc mid is a type int
		mid := (low + high) / 2
		if item == list[mid] {
			return mid
		} else if item < list[mid] {
			high = mid - 1
		} else {
			low = mid + 1
		}

	}
	return -1
}

Data Structures

Arrays, Slices, and LinkedLists

Arrays are a collection of values, stored in sequential memory. In go arrays are of a fixed length, and cannot be resized. More often in go developers will interact with the Slice data type, and allow arrays to act as the underlying building block for slices.

testArray := [4]int{2, 4, 6, 8}

for i, value := range testArray {
	fmt.Printf("testArray[%d] memory address: %v\n", i, &value)
}

// testArray[0] memory address: 0xc00011a040
// testArray[1] memory address: 0xc00011a048
// testArray[2] memory address: 0xc00011a060
// testArray[3] memory address: 0xc00011a068

Linked lists are a collection of values AND a pointer to the next value contained within the list. They are distinct from arrays in that they are not stored in contiguous memory. As a result appending a new item to the list does not requrie an copying the entire list to a new data structure.

The trade-offs between these two structures should become apparent quickly;

Arrays are better for reading items as they allow for random access whereas linked lists require sequential access
Linked lists more easily allow for insertions and deletions of data in the middle of lists as only one element has to be updated
Arrays require more memory in reserve, as they cannot be resized after initialize, where linkedlists only require additional memory available to store additional values

Hash Tables

Hash tables, sometimes called hash maps, maps, dictionaries, and associative arrays, is array of key value pairs, that uses a hash function to consistantly map values to an index in a deterministic way. Hash functions accept byte data and should return an index. Hash functions allow data look ups in O(1) time complexity making them one of the most powerful data structures in computer science.

Good hash functions will determine the effectiveness of the underlying data structure.

They should consistently map data to the same index
should map different data to different indexes
should be aware of the underlying array structure and only return valid array indexes
should minimize the number of collisions (returning the same index for multiple values)

Because of these capabilities hash tables are extremely effective at

preforming lookups
modeling relationships from one thing to another
checking for duplicative entries in a given set of data
acting as a cache

Collisions

As mentioned above hash tables should avoid collisions as much as possible. But what if the set of data is greater than the length of the underlying array, or by its nature the hashing function must return duplicative indexes. One such example would be if you were sorting names in an employee directory, and wanted a hash table where the first letter in the last name was the hash. Of course there would be multiple entries for each letter.

To accomidate this, while still maintaining some of the benefits of hash tables, a linked list can be created to allow multiple entries to be added to a single hash index. This is not the only solution, but a common pattern.

A comparison of the above data structures and their average time complexity

	Hash Maps (Worst)	Hash Maps (Average)	Arrays	Linked Lists
Search	$O(n)$	$O(1)$	$O(1)$	$O(n)$
Insert	$O(n)$	$O(1)$	$O(n)$	$O(1)$
Delete	$O(n)$	$O(1)$	$O(n)$	$O(1)$

Tree data structures

a tree is a nonlinear structure (e.g. a list, or queue) in which each node is cdapable of having many successor nodes, called children.

Trees are recursive structures. every node must have a unique parent, with the exception of the root node.
- another way of thinking of this is that there must be a unique path to each and every node.

Binary trees

a binary tree is a tree where each node is capable of having up to 2 children.

if a binary tree node has no children, then it is called a leaf

Binary search trees

A unique feature of binary search trees is how data is added to the trees

When added data to a binary search tree:

nodes with smalled data than the root node are inserted in the left subtree
nodes with larger data than the root node are inserted in the right subtree

There are 3 traversal patterns of a binary search tree

preorder traversal - visit the root, visit the left subtree, visit the right subtree
inorder traversal - vist the left most subtree, visit the root, visit the right subtree
- visits nodes in order of smallest to largest
postorder traversal - vist the left subtree, visit the right subtree, visit the root