Share this
Domain Classification—A Look Down the Rabbit Hole
by Gregg Jones on Aug 16, 2022 12:00:00 AM
You google a site you need for work and come across DNSFilter's block page. Ever wonder what happens when you hit the report button in our software or notify the IT department about how you definitely need this website “scientifically ranking cute puppies" to be accessible?
First off, I agree with you, it is a travesty that it is blocked in the first place. Allow me to take you on a journey behind the curtain for a second to get a peek at the wide and wild world of domain categorization and report processing.
Common Categorization Complaints
First there's a rather common report. This involves the good ol' Parked & Under Construction category. Oft victims of not being built soon enough with enough significant content for when they get queued up for analysis, these are some road-less-traveled sites.
I can hear your thought process now, dear reader: "You mean there aren't millions of visitors to Uncle Jim's Clown Emporium with one picture of him crying while making balloon animals?!?"
Correct. When there isn't enough content, or if it misses the timing of its initial scan, the AI will look at it and say, "Hey looks like this one is still in the oven," and sticks it into Parked & Construction.
A Human Touch to Domain Categorization
This is where the Domain Intelligence team comes in. "Oh this is definitely Art & Entertainment." Though we may be stretching the definition of Art in Uncle Jim's case (does this even qualify as a business?) Oh well, yes technically, he has a link for tips in the very bottom left corner of an infinite scroll.
Joking aside, there are many factors when we take into account categorization of a domain. Overall a stout ruling of "empirical evidence is king" when looking to fit things into categories is the way to go. This involves an amount of research that would surprise you and a meticulous evaluation of every site that comes across the desk.
"But how hard is it to come to Arts/Business and send it on its way?"
Well that varies, to be honest, and some sites are easier than others. There are other research factors as well, especially when evaluating threats such as malware and phishing—things to consider like history, overall health of the domain, security flaws and more. Luckily, we have the luxury of being able to have multiple categories across sites and a large repository of data to consider when evaluating sites.
Domain Categorization Isn’t Cut and Dry
For a prime and tangible day-to-day example you can take YouTube: It takes content all over the spectrum from Sports to Games to Music to Education and Self Help, to Business talks, to Tech etc etc etc. The category list could get staggering very quickly. In this case we can distill down YouTube into a broader Entertainment Category, since its primary goal is to entertain. Something that is a bit more specific, like Twitch.tv that caters more to the gamer niche, would be Games & Entertainment.
This is a way zoomed out look at the rabbit hole you can go down when dissecting content and dealing with even basic categorizations. This can get even zanier the more extensive the content and the more complex the website. So as you hit the report button or forward onto IT the Clown Emporium to get unblocked, consider what you would paint a site as.
It's always an interesting time to process these reports! Between all of the variations and combinations, it's well worth it to make sure the content our customers’ experience is well managed, well labeled and making the internet overall a safer place.
That's all from the Domain Intelligence Desk today! Have questions you want answered? Tweet us @dnsfilter!
Share this
Categories
- Featured (264)
- Protective DNS (21)
- IT (15)
- IndyCar (9)
- Content Filtering (8)
- Cybersecurity Brief (7)
- IT Challenges (7)
- Public Wi-Fi (7)
- AI (6)
- Deep Dive (6)
- Malware (4)
- Roaming Client (4)
- Team (4)
- Compare (3)
- MSP (3)
- Phishing (3)
- Tech (3)
- Anycast (2)
- Events (2)
- Machine Learning (2)
- Ransomware (2)
- Tech Stack (2)
- Secure Web Gateway (1)
Customer experience is the secret sauce that sets successful Managed Service Providers (MSPs) apart from the rest. In a market teeming with competition, you need to offer more than the best technology or the lowest prices. It's about how clients feel when they interact with your services. A stellar customer experience can transform a one-time client into a loyal advocate, while a poor one can send them running to your competitors. According to a ...
In July I published a blog on the DNSFilter website where I looked closely at our passive DNS data, highlighting early election trends in relation to threat domains.
The Children's Internet Protection Act (CIPA) is a critical law designed to ensure that students are protected from harmful online content. It requires schools and libraries to implement Internet safety measures, such as filtering and monitoring, to safeguard minors. Compliance with CIPA is essential for institutions seeking E-Rate program discounts for Internet access and internal connections.