Skip to content

VascoSch92/data_structure

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

60 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Data Structure


A collection of data structures for Python.

Overview

Data Structure Description Type
Linked List Linear data structure where elements, called nodes, are stored in a sequence. Linked List Type
Double Linked List Linked list where each node has two references: one for the next node and one for the previous node. Linked List Type
Stack Linear data structure that follows the Last In, First Out (LIFO) principle. Stack Type
MinMaxStack Regular stack where you can access the min and max of the elements presents in the stack in constant time. Stack Type
Queue Linear data structure that follows the First In, First Out (FIFO) principle. Queue Type
MaxPriorityQueue Data structure that maintains a collection of elements where each element has an associated priority value. Queue Type
MinPriorityQueue Data structure that maintains a collection of elements where each element has an associated priority value. Queue Type
BinaryTree A binary tree is nothing but a tree where each node has at maximum 2 children. Tree Type
BinarySearchTree A binary tree with special properties on the node values. Tree Type
Trie Tree-like data structure used to store a dynamic set of strings. Trie Type
MatchTrie Regular trie, where you can search words with matching. Trie Type
HashSet Data structure that stores a collection of unique elements. Hash Table Type
HashMap Data structure that provides an efficient way to store and retrieve key-value pairs. Hash Table Type
MaxBinaryHeap Data structure that provides an efficient way to retrieve the max value. Heap Type
MinBinaryHeap Data structure that provides an efficient way to retrieve the min value. Heap Type
Cache A simple fixed-capacity in-memory key-value store. Cache Type
LRUCache Implementation of a Least Recently Used (LRU) Cache with a fixed capacity. Cache Type
LFUCache Implementation of a Least Frequently Used (LFU) Cache with a fixed capacity. Cache Type

Linked List Type


Linked List

A linked list is a linear data structure where elements, called nodes, are stored in a sequence. Each node contains two parts: the data and a reference (or pointer) to the next node in the sequence. The first node is called the head, and the last node points to null, indicating the end of the list.

Instantiation

  • Empty: linked_list = LinkedList()

  • From a Sequence type object: linked_list = LinkedList(SEQUENCE_TYPE_OBJ)

Attributes

  • head: the head of the linked list (ListNode type)
  • tail: the tail of the linked list (ListNode type)

Methods

Method Description Time Complexity
prepend Add a new node with the given value at the begin of the linked list. O(1)
append Add a new node with the given value at the end of the linked list. O(1)
insert Insert a new node at the index-th with the given value. O(n)
get Return the index-th node in the linked list, if the index is valid. O(n)
remove Remove the index-th node in the linked list, if the index is valid. O(n)
is_empty Return True if the linked list is empty. Otherwise, False. O(1)

Double Linked List

A doubly linked list is a more complex version of a linked list where each node has two references: one pointing to the next node and another pointing to the previous node. This bidirectional structure allows traversal in both directions (forward and backward). Like a regular linked list, the first node is the head, and the last node is the tail.

Instantiation

  • Empty: double_linked_list = DoubleLinkedList()

  • From a Sequence type object: double_linked_list = DoubleLinkedList(SEQUENCE_TYPE_OBJ)

Attributes

  • head: the head of the linked list (DoubleListNode type)
  • tail: the tail of the linked list (DoubleListNode type)

Methods

Method Description Time Complexity
prepend Add a new node with the given value at the begin of the double linked list. O(1)
append Add a new node with the given value at the end of the double linked list. O(1)
insert Insert a new node at the index-th with the given value. O(n)
get Return the index-th node in the double linked list, if the index is valid. O(n)
remove Remove the index-th node in the double linked list, if the index is valid. O(n)
is_empty Return True if the double linked list is empty. Otherwise, False. O(1)

Stack Type


Stack

A stack is a linear data structure that follows the Last In, First Out (LIFO) principle. This means that the last element added to the stack is the first one to be removed.

Instantiation

  • Empty: stack = Stack()

  • From a Sequence type object: stack = Stack(SEQUENCE_TYPE_OBJ)

Methods

Method Description Time Complexity
pop Delete and return the last element added to the stack. O(1)
push Push the element value at the top of the stack. O(1)
peek Return the last element added to the stack. O(1)
is_empty Return True if the stack is empty. Otherwise, False. O(1)

MinMaxStack

A regular stack where you can access the minimum and maximum of the elements presents in the stack in constant time.

Instantiation

  • Empty: stack = MinMaxStack()

  • From a Sequence type object: stack = MinMaxStack(SEQUENCE_TYPE_OBJ)

Methods

Method Description Time Complexity
pop Delete and return the last element added to the stack. O(1)
push Push the element value at the top of the stack. O(1)
peek Return the last element added to the stack. O(1)
min Return the min element presents in the stack. O(1)
max Return the max element presents in the stack. O(1)
is_empty Return True if the stack is empty. Otherwise, False. O(1)

Queue Type


Queue

A queue is a linear data structure that follows the First In, First Out (FIFO) principle. This means the first element added to the queue is the first one to be removed.

Instantiation

  • Empty: queue = Queue()

  • From a Sequence type object: queue = Queue(SEQUENCE_TYPE_OBJ)

Methods

Method Description Time Complexity
enqueue Push the element value at the end of the queue. O(1)
dequeue Delete and return the first element of the queue. O(1)
peek Return the first element of the queue. O(1)
is_empty Return True if the queue is empty. Otherwise, False. O(1)

Max Priority Queue

A max priority queue is a data structure that maintains a collection of elements where each element has an associated priority value. In a max priority queue, elements with higher priority values are dequeued (removed) before elements with lower priority values.

Note that in case of same priority, the FIFO principle is applied.

Instantiation

  • Empty: max_priority_queue = MaxPriorityQueue()

  • From a sequence of priorities and values: max_priority_queue = MaxPriorityQueue(SEQUENCE_TYPE_OBJ)

Methods

Method Description Time Complexity
enqueue Add value to the queue with given priority. O(log(n))
dequeue Remove and return highest-priority value. O(1)
peek Return highest-priority value without removing it. O(1)

Min Priority Queue

A min priority queue is a data structure that maintains a collection of elements where each element has an associated priority value. In a min priority queue, elements with lower priority values are dequeued (removed) before elements with higher priority values.

Note that in case of same priority, the FIFO principle is applied.

Instantiation

  • Empty: min_priority_queue = MinPriorityQueue()

  • From a sequence of priorities and values: min_priority_queue = MinPriorityQueue(SEQUENCE_TYPE_OBJ)

Methods

Method Description Time Complexity
enqueue Add value to the queue with given priority. O(log(n))
dequeue Remove and return lowest-priority value. O(1)
peek Return lowest-priority value without removing it. O(1)

Tree Type


A tree is a data structure used to represent hierarchical relationships between elements. It consists of nodes connected by edges, and it follows a specific organization that resembles a tree in nature.

Binary Tree

A binary tree is nothing but a tree where each node has at maximum 2 children.

Instantiation

  • Empty: binary_tree = BinaryTree()

  • From a Sequence type object: binary_tree = BinaryTree(SEQUENCE_TYPE_OBJ)

Attributes

  • root: the root of the tree (BinaryTreeNode type)

Methods

Method Description Time Complexity
preorder_traversal Return the preorder traversal of the tree. O(n)
inorder_traversal Return the inorder traversal of the tree. O(n)
postorder_traversal Return the postorder traversal of the tree. O(n)
levels_traversal Return the level order of the tree. O(n)

Binary Search Tree

A Binary Search Tree (BST) is a Binary Tree with the following two properties for every node:

  • All values in the left subtree of the node are smaller than the node's value.

  • All values in the right subtree of the node are greater than the node's value.

It allows efficient searching, insertion, and deletion.

Instantiation

  • Empty: binary_search_tree = BinarySearchTree()

  • From a Sequence type object: binary_search_tree = BinaryTree(SEQUENCE_TYPE_OBJ)

Attributes

  • root: the root of the tree (BinaryTreeNode type)

Methods

Method Description Time Complexity
insert Insert a node into the binary search tree. O(log(n))
delete Delete a node from the binary search tree. O(log(n))
find Return the node with the given value if found else None. O(log(n))

Note: All methods of the class BinaryTree are also inherited by the class BinarySearchTree.

Trie Type


Trie

A Trie (pronounced "try") is a tree-like data structure used to store a dynamic set of strings, where each node represents a single character of a string. It is especially efficient for searching words, making it useful for applications like autocomplete, spell checking, and prefix-based searches.

Instantiation

  • Empty: trie = Trie()

  • From a Sequence type object of strings: trie = Trie(SEQUENCE_TYPE_OBJ)

Methods

Method Description Time Complexity
add Add a word in the trie. O(n)
search Search if a word is in the trie. O(n)
startswith Search if a word in the trie start with the given prefix. O(n)
remove Remove the given word from the trie. O(n)

Match Trie

A regular trie, where you can search words with matching.

Base match character is '.', but you can change it with the parameter match.

Instantiation

  • Empty: trie = MatchTrie()

  • From a Sequence type object of strings: trie = MatchTrie(SEQUENCE_TYPE_OBJ)

Methods

Method Description Time Complexity
add Add a word in the match trie. O(n)
search Search if a word is in match the trie. O(n)
startswith Search if a word in the match trie start with the given prefix. O(n)
remove Remove the given word from the trie. O(n)

Hash Table Type


Hash Set

A hash set is a data structure, implemented with a hash table, that stores a collection of unique elements, offering efficient operations such as insertion, deletion, and lookup.

Instantiation

  • Empty: hash_set = HashSet()

  • From a Sequence type object of strings: hash_set = HashSet(SEQUENCE_TYPE_OBJ)

Methods

Method Description Time Complexity
add Add an item into the HashSet. O(1)
remove Remove an item from the HashSet. O(1)
contains Check if an item is in the HashSet. O(1)
clear Clear the HashSet. O(n)

Hash Map

A hash map is a data structure that provides an efficient way to store and retrieve key-value pairs.

Instantiation

  • Empty: hash_map = HashMap()

Methods

Method Description Time Complexity
get Get the value of the given key. O(1)
add Add an item, a couple (key, value), into the HashMap. O(1)
remove Remove a key, and the respective value, from the HashSet. O(1)
keys Retrieve the keys of the Hash Map as generator. O(n)
values Retrieve the values of the Hash Map as generator. O(n)
items Retrieve the pairs (key, value) of the Hash Map as generator. O(n)
clear Clear the HashMap. O(n)

Heap Type

A heap is a specialized binary tree-based data structure that satisfies the heap property


Max Binary Heap

A max binary heap is a heap data structure that takes the form of a binary tree. The largest element is always at the root of the tree.

Instantiation

  • Empty: max_heap = MaxBinaryHeap()

  • From a Sequence type object of strings: max_heap = MaxBinaryHeap(SEQUENCE_TYPE_OBJECT)

Methods

Method Description Time Complexity
insert Insert the element value into the max heap. O(log(n))
pop Remove and return the largest element from the heap. O(1)
max Return the largest element without removing it. O(1)

Min Binary Heap

A max binary heap is a heap data structure that takes the form of a binary tree. The smaller element is always at the root of the tree.

Instantiation

  • Empty: min_heap = MinBinaryHeap()

  • From a Sequence type object of strings: min_heap = MinBinaryHeap(SEQUENCE_TYPE_OBJECT)

Methods

Method Description Time Complexity
insert Insert the element value into the min heap. O(log(n))
pop Remove and return the smaller element from the heap. O(1)
min Return the smaller element without removing it. O(1)

Cache Type

A cache is a temporary storage layer that holds frequently accessed data to improve performance and reduce latency. It stores data closer to the user or system, avoiding repeated retrieval from slower storage or computations.

Note: Every implemented cache strategy can also be used as a decorator. See the end of this section.

Cache

A simple fixed-capacity in-memory key-value store.

Instantiation

  • Instantiation with given capacity (default is 1024): cache = Cache(capacity=POSITIVE_INTEGER)

Methods

Method Description Time Complexity
get_capacity Return the capacity of the cache. O(1)
get_cache Return the cache in the order the elements were added. O(1)
get Get an element from the cache. O(1)
put Put an element into the cache if capacity is not exceeded. O(1)

Least Recently Used Cache

Implementation of a Least Recently Used (LRU) Cache with a fixed capacity.

Instantiation

  • Instantiation with given capacity (default is 1024): lru_cache = LRUCache(capacity=POSITIVE_INTEGER)

Methods

Method Description Time Complexity
get_capacity Return the capacity of the cache. O(1)
get_cache Return the cache in the order of usage. O(1)
get Get an element from the cache. O(1)
put Put an element into the cache, eventually evict least recent used element. O(1)

Least Frequently Used Cache

Implementation of a Least Frequently Used (LFU) Cache with a fixed capacity.

Instantiation

  • Instantiation with given capacity (default is 1024): lfu_cache = LFUCache(capacity=POSITIVE_INTEGER)

Methods

Method Description Time Complexity
get_capacity Return the capacity of the cache. O(1)
get_cache Return the cache. O(1)
get Get an element from the cache. O(1)
put Put an element into the cache, eventually evict least frequently used element. O(1)

Decorators

Every above implemented cache can also be used as a decorator. Here, a small example.

from ds import lfu_cache

@lfu_cache(capacity=2)
def multiply(a, b):
    print(f"Calculating {a} * {b}")
    return a * b

print(multiply(2, 3))  # Calculating 2 * 3 -> 6
print(multiply(2, 3))  # Cache hit -> 6
print(multiply(3, 4))  # Calculating 3 * 4 -> 12
print(multiply(5, 6))  # Calculating 5 * 6 -> 30 (evicts least frequently used: 3 * 4)
print(multiply(2, 3))  # Cache hit -> 6
print(multiply(3, 4))  # Calculating 3 * 4 -> 12 (recomputed, since it was evicted)

About

A collection of data structure for Python

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages