Investigating IPTV Malware in the Wild

: Technologies providing copyright-infringing IPTV content are commonly used as an illegal alternative to legal IPTV subscriptions and services, as they usually have lower monetary costs and can be more convenient for users who follow content from diﬀerent sources. These infringing IPTV technologies may include websites, software, software add-ons, and physical set-top boxes. Due to the free or low cost of illegal IPTV technologies, illicit IPTV content providers will often resort to intrusive advertising, scams, and the distribution of malware to increase their revenue. We developed an automated solution for collecting and analysing malware from illegal IPTV technologies and used it to analyse a sample of illicit IPTV websites, application (app) stores, and software. Our results show that our IPTV Technologies Malware Analysis Framework (IITMAF) classiﬁed 32 of the 60 sample URLs tested as malicious compared to running the same test using publicly available online antivirus solutions, which only detected 23 of the 60 sample URLs as malicious. Moreover, the IITMAF also detected malicious URLs and ﬁles from 31 of the sample’s websites, one of which had reported ransomware behaviour.


Introduction
As many illicit content providers supply copyright-infringing IPTV content for free, they often rely on intrusive advertising, tracking, scams, and malware to make a profit.Illicit content providers also get paid by malware authors to infect their own illicit IPTV websites or software, which are frequently malware families that can generate indirect income with ease, such as cryptocurrency-mining malware [1,2].One study found that dozens of illicit IPTV websites contained "download now" adverts, redirecting users to landing pages with instructions for downloading malicious browser extensions [3].Another study found that for the Australian population, 99% of advertisements on illicit content-sharing websites were categorised as high risk, with 46% of these advertisements classified as malicious [4].Overall, this implies that using copyright-infringing IPTV technologies poses a significant risk to users' devices, as illicit content providers cannot be trusted and may use malware to increase their revenue.
Internet Protocol Television (IPTV) is a service that provides TV programmes and on-demand video content under the TCP/IP internet protocols [5].Most legal IPTV services supply video content as part of a TV licence, one-time purchase, or paid subscription, whereas free alternatives usually incorporate advertisements.While IPTV services each have many different genres of video content, users often need to subscribe to multiple services to view the specific films and TV programmes they want to watch.Due to the limited effort, convenience, and monetary costs of these legal IPTV services, many users instead choose to use copyright-infringing IPTV technologies to view video content illegally.A report by Sandvine estimates that "roughly 6% of all households in North America currently have a Kodi device configured to access unlicensed files and streams" [6].This implies that many IPTV users are willing to risk compromising their home network by installing potentially infected software to access unlicensed video content in addition to possible legal action for copyright infringement.
One of the reasons why infringing IPTV users are willing to risk legal action or exposure to malware and scams is that illegal IPTV services are usually either free or have low monetary costs in comparison to legal IPTV services [7].Additionally, applications and various software for accessing infringing content can be installed from unofficial software and application stores on physical set-top boxes, which appeals to users due to its ease of use, facilitating a "wide range of illicit content being available in one place, without the need for multiple subscriptions" [8].Because of the lower perceived costs, these users are willing to accept the risks of trusting and potentially disclosing credit card or personal information to illicit content providers who cannot be trusted and who could be frauds.
The addition of the malware threat to this criminal environment adds another dimension to criminal activities, creating a poly-criminal environment.This was observed in the technical report on the online investigation of IP crime [9], which found that in many cases, the financial gain in one criminal activity supports the other, thus creating a vicious many-folded criminal ecosystem.Another importance in illuminating the malware ecosystem behind illegal IPTV is that it will challenge the judicial process behind the illegal IPTV ecosystem by stressing the fact that the severity of crime increases once the malware is added into the mix.
Our research found that currently, there are insufficient forensic investigation studies and actionable intelligence collections available to effectively identify the risks and threats of malware from using illegal IPTV technologies [10].This is in part due to an inadequate number of studies and malware analyses that have assessed the likelihood and severity of malware threats in the illegal IPTV ecosystem.An exception is the research that has been conducted for malicious PDF files and fileless threats in the form of malicious JavaScript codes [11].Features from the embedded PDF files are extracted and utilised for classification.The differentiation of our study with this study is that we have provided an investigation framework, and our classification is based on cyber threat intelligence sources, which depends on the collective work of cyber threat analysts.The novelty of the framework comes within the niche field of IPTV and the recursive collection and query methodology rather than through signature generation over extracted features.Similar malicious URL detection literature can be found in the work of Aljabri et al. [12].
Moreover, malware threats are constantly changing, necessitating malware analyses to be completed consistently on an ongoing basis to provide timely threat intelligence about the risks of infringing IPTV technologies [13][14][15][16].Although there are a small number of indicators of compromise (IOCs) relating to illegal IPTV technologies that exist, such as the AlienVault Open Threat Exchange intelligence platforms, our research shows that there is minimal threat intelligence about the main types of malware used, their severity, and the prominent Tactics, Techniques, and Procedures (TTPs) used by threat actors to exploit the devices of infringing IPTV technology users.
However, in other areas of research, the prevalent malware families, their severity, and the likelihood of infection from using copyright-infringing technologies have been identified.For example, Bosco and Shalaginov, while not focusing specifically on infringing IPTV technology, identified the malware families found in technologies providing copyright-infringing digital content and actionable threat intelligence into the main malware threats from digital content piracy technologies, although these may be less relevant for threat intelligence researchers in the future due to the ever-changing nature of malware threats [17,18].
For example, a security report by Ponemon Institute found that fileless malware [19][20][21] a acks have been steadily increasing over time, while the number of zero-day a acks nearly doubled between 2019 and 2020 [22].Additionally, Bosco and Shalaginov found that when searching for popular films in a web browser to identify illicit IPTV websites, 20% of websites had been removed from the search results and replaced with new sites between two rounds of analysis, with 8% of the websites classified as malicious by Vi-rusTotal [17].These results suggest that both malware threats and the copyright-infringing IPTV ecosystem are changing continuously.This may be partially due to legal "whacka-mole" enforcement that removes illicit websites, coupled with the development of zeroday malware to exploit new vulnerabilities.Therefore, to address this issue, in this paper, we present a framework for analysing malware threats in illegal IPTV technologies, the illegal IPTV Technologies Malware Analysis Framework (IITMAF).
The IITMAF has been designed to identify new threats and risks and encourage ongoing evaluation of these.The framework consists of a methodology for securely detecting, collecting, and analysing malware from infringing IPTV technology in addition to identifying and assessing the risks of using these technologies.Moreover, the framework provides a software solution for automating the collection and analysis of malware found in illegal IPTV websites and software, with the focus of the automated software solution being the provision of actionable information.
Actionable information should be relevant, timely, accurate, complete, and ingestible [23].The IITMAF aims to meet these actionable information requirements for malware in infringing IPTV technologies by providing a solution that can quickly analyse identified URLs for illicit IPTV websites, app stores, and software files using static and dynamic analysis techniques and can consecutively generate a comprehensive report in a structured format.In addition to the identification of malicious URLs and malware analysis capabilities, the framework has also shown to be useful for generating cyber threat intelligence, TTPs, and similar a ack pa erns, as can be found in the evaluation section.The rest of the paper is structured as follows: Section 2 provides a comprehensive review of the relevant literature on cyber threat intelligence in the IPTV ecosystem and makes the necessary definitions.Section 3 outlines the high-level methodology, including the data collection and data flow diagram of the framework.Section 4 presents and discusses the implementation, the test case, and the evaluation.Section 5 critically evaluates the main findings and their implications, and Section 6 concludes the paper while presenting the limitations.

Background
Illegal IPTV technologies consist of physical set-top boxes, websites, or software in the form of standalone applications or illicit add-ons to legal software [7].Supplying these technologies for use without paying for the content they transmit is a crime.One study identified trojan, adware, spyware, and backdoor malware from content theft websites [17], implying that illicit IPTV providers may include malware in their websites and software or advertise malware disguised as a desirable application to increase their profits.Thus, as illicit IPTV providers are already commi ing a crime, it appears that, in addition, many of these providers elect to distribute malware as part of the delivery to supplement their income.The following sections will define the illegal IPTV technologies used and their risks in addition to outlining relevant malware collection and analysis techniques for these technologies.
Illicit Streaming Devices (ISDs) are physical boxes or USB sticks that connect to a TV to provide free television and film content that you would usually pay to view [8].Many physical IPTV boxes, such as Kodi boxes, are legal, but third-party software can be installed to illegally stream IPTV content for free.Conversely, other physical IPTV boxes, often described as "fully loaded" or "jailbroken", already include software for facilitating illegal IPTV streaming.Because ISD providers are already willing to commit copyright infringement, they are more likely to commit further breaches of law, as it could increase their profits.Therefore, ISD providers cannot be trusted, as they could supply users with ISDs infected with malware.
Overall, ISD providers are unlikely to infect the products they sell with high-impact malware when they are already making a profit from selling ISDs.However, potential users would likely purchase an ISD from a website, which itself could be a scam designed to try and get individuals to disclose their credit card details.Moreover, fake websites that advertise illegal IPTV boxes could also distribute fileless malware when visited by users.Therefore, there are other risks to purchasing and using ISDs, as the providers cannot be trusted and could be a empting to scam people.
Websites for streaming IPTV content illegally are available over the surface web, with examples including PutLocker and FlixTor.IPTV content is usually freely available on these websites, which is a ractive to users who are not willing to pay for a streaming service or risk purchasing an ISD.However, as the illicit content provided is often free, IPTV websites are untrustworthy, as they are more likely to rely on trackers, scams, and malware to gain a profitable income.
Illegal IPTV websites have different strategies for providing infringing IPTV content.Many sites host and potentially live-stream IPTV content on their website, although these websites are more likely to be detected by anti-piracy organisations.To reduce the risk of legal action, some websites collect and contain lists of hyperlinks to websites for accessing IPTV content illegally, known as "link aggregators" [9].Link aggregators can also be found on legitimate websites, such as GitHub repositories and forum posts.
Once more, using illegal IPTV websites or aggregators is risky, as they cannot be trusted.In comparison to ISDs, IPTV websites are potentially riskier because they receive no income for providing free IPTV content, whereas ISDs are purchased.Hence, IPTV websites rely on intrusive advertising and malware to gain a profit, with adverts often redirecting users to malicious or scam websites when clicked on [7].This is known as malvertising (malicious advertising), which distributes malware by injecting online advertisements with malicious code [24].Cybersecurity company RiskIQ found that 1 in 3 content theft websites expose visitors to malware, with hackers paying the providers USD 70 million to add malware to their websites [1].This implies there is a significant chance of users obtaining infected with high-severity malware, especially if hackers are willing to pay a total of USD 70 million.
Malware can be distributed to users of illegal IPTV websites when users download video files for watching IPTV content, such as MP4 files.When an infected file is opened, the malware will execute on the user's device, which could be anything from ransomware to a remote access trojan (RAT).Although users may not realise the risks of downloading files from an untrustworthy source, technically proficient users will be aware of the risks and are likely to mitigate the risk of infecting their devices by using an antivirus that scans files or a virtual machine (VM).
However, this is not the only risk of using IPTV websites.Another risk is fileless malware.Fileless malware does not require a user to download a malicious file; rather, it exploits vulnerable applications on a victim's device to enable the injection of malicious code into its main memory [25].Fileless malware is a high risk to users, as it is unlikely to be detected by antivirus signatures and could potentially infect a user as soon as they visit a website [26].
One study analysed the malicious codes in embedded PDFs.Moreover, malicious codes embedded into the PDF files present a prevalent way of infecting the main memory and using malicious JavaScript codes [11].There are several recent data-dependent malicious URL identification and classification studies in the literature based on machine learning, deep learning, or an ensemble of classification algorithms [12,[27][28][29][30][31].Furthermore, considering the methodology, our paper shows similarity in the collection of cyber threat intelligence with the works of Ghaleb et al. [30] and by the similarity of analysing the websites within a framework with the works of Rafsanjani et al. [31].However, our paper differentiates from the literature by providing an exemplar analysis for the niche domain of illicit IPTV websites' identification of malvertising and providing the tools and best practices that frame the investigation guideline.
Furthermore, many threat intelligence platforms, such as VirusTotal and AlienVault Open Threat Exchange (OTX), do not recognise these sites as malicious.To illustrate, we scanned 1555 illicit IPTV websites and aggregators gained from an IPTV GitHub repository in VirusTotal.Of these websites, only 34 were identified as malicious or had an association with malware for both VirusTotal and AlienVault OTX even though many of these websites contained intrusive advertisements a empting to scam users into downloading potentially unwanted programs (PUPs) that may have been malicious.
Infringing IPTV (Pro v7.0.6) software includes standalone desktop and mobile phone applications in addition to add-ons or plugins for legitimate IPTV software, such as Kodi.Using standalone applications to access IPTV content illegally often requires paid subscriptions.A study found that a business, SET TV, offered infringing IPTV content to over 180,000 users with a USD 20 monthly or USD 200 annual subscription via a standalone software application [7].Again, it is less likely that providers will infect IPTV applications with malware if they are already making a profit.In comparison to using websites for IPTV, there is more incentive for IPTV website providers to include malware, as the content provided is usually free.However, downloading and executing an application infected with malware could have a greater impact if users do not have antivirus software installed.
While studies suggest that standalone infringing IPTV applications have a considerable number of users, another study found that 26 million Kodi users (68% of the total user base) were pirating illegal IPTV content using Kodi (20.2) software add-ons [32].Although these add-ons are often downloaded from likely benign GitHub repositories, the Digital Citizens Alliance found that third-party Kodi add-ons were used to distribute cryptocurrency-mining malware [2].Similarly, Warrior et al. found that 1.4% of Kodi addons resolved to domain IP addresses found on malicious blacklists (131 out of 9146 addons studied) [10].This implies that add-ons facilitating the streaming of illicit IPTV content are more widely used than standalone applications and may be more likely to be infected with malware.

Implementation
Requirements for the framework were elicited through secondary research of the existing literature on the topics of malware forensics and the illegal IPTV ecosystem, which enabled the gathering of requirements for the collection and analysis of malware from illegal IPTV technologies.Exploratory research consisting of analysing a sample of 1555 illicit IPTV websites in AlienVault OTX and VirusTotal was also completed to obtain the requirements.The research results allowed the identification of the types of malwares in illicit IPTV websites, determining the approaches for detecting and collecting malware from these websites and establishing the methodology and best practices of the framework for collecting and analysing a dataset of illegal IPTV technologies.
Figures 1 and 2 present the overall methodology and a high-level data flow diagram, respectively.The software solution has a client-server structure with an API "dispatcher" server to handle requests from a "requester" client.This can enable multiple users with requester clients to issue requests to the dispatcher, providing scalability.The IITMAF consists of several components working together to support the investigations for the IPTV ecosystem.A web scraper was designed and implemented for the collection of malicious scripts and files.A Tor requester was implemented in the scraper so that the anonymity of the investigator was ensured.For the collected artefacts, both file-based and fileless static analysers were included in the framework since, as explained previously, malicious actions and objectives are sometimes provided within a shellcode/script format.In Figure 2 below, a detailed data flow diagram of how these modules interact with each other is given.
The development environment of the IITMAF's automated software solution consisted of a Windows 10 PC with Python 3.8.6 installed.Visual Studio Code was selected for the IDE, as it supports Python and integrates with GitHub.Moreover, Tor was installed and configured to run as a Windows service to enable the IITMAF software solution to make anonymous HTTP requests.
The dispatcher (see Figure 2) provides a REST API that enables the collection and analysis of malware from illegal IPTV technologies.It serves several API endpoints that are called by the requester client to submit URLs.While the API is currently configured to run locally, it can be configured to make the API accessible over the internet, enabling any cybersecurity professional to identify and analyse malware in the illegal IPTV ecosystem.Additionally, the API endpoints can be accessed using Curl, Postman, or any programming language that can initiate and handle HTTP endpoints, potentially allowing organisations, such as intellectual property offices, to use the IITMAF as a threat feed for providing cyber threat intelligence.
The dispatcher has four endpoints, consisting of endpoints for scraping URLs from a webpage; iteratively scraping URLs from a website; static analysis; and dynamic analysis.The requester client is used to submit the dataset of illicit IPTV URLs to the dispatcher API server sequentially and process the results to generate a report for each URL submitted.This allows multiple URLs to be submi ed to the API and be analysed without having to manually submit every URL.

Malware Collection and Analysis
For the malware analysis functions, file and webpage URLs are collected from a given webpage URL using a website scraper, which is then added to a list and queued for analysis, with the given webpage's URL and domain appended to the front of this list as shown in Figure 3.If a URL pointing to a non-webpage file is submi ed, the webpage scraper collects URLs and files from the URL path with the filename removed instead, which will include the original URL submi ed.
Once the URLs have been collected, they are queued for either file-based or fileless static analysis.If a URL contains a webpage file type or no file type, it is submi ed to static fileless analysis.Otherwise, URLs are submi ed to file-based static analysis with the files downloaded first.The URLs and downloaded files are then compared with either file-based malware or fileless malware YARA rules, identifying any binary or textual pa erns that indicate a file or webpage has malicious capabilities.Any YARA rule matches for a URL or file are then recorded and stored as a JSON object to be used in the generated report.When the static analysis process is complete, the collected URLs are submi ed to either file-based or fileless dynamic analysis, similar to the static analysis process.However, JavaScript (JS) files are submi ed to both file-based and fileless dynamic analysis, as JS files can contain malware, and hence, analysing JS files in different environments could provide more accurate results.
The dynamic analysis entails submi ing the collected URLs and files to an isolated sandbox environment for automated analysis using the Hybrid Analysis API (h ps://www.hybrid-analysis.com/(accessed 25 August 2023)) with a generated analysis report returned once the analysis is complete.

Report Generation
When all the collected files and URLs have been analysed, a single report is generated for each URL submi ed to the dataset text file.Each report contains a section for the URLs retrieved from the iterative URL collection function, static analysis results, and dynamic analysis report file paths.The report only contains the file paths of the Hybrid Analysis reports generated from the dynamic analysis or else the report would be too large.Figure 4 displays a sample of a generated analysis report and shows the dynamic analysis reports generated and the URLs scraped from the given URL webpage.

Evaluation
To evaluate the IITMAF and showcase the main features of the IITMAF's automated software solution, we trialled the software on an illicit IPTV website, zmovies.co,applying the automated methodology of the framework outlined in Section 3.1.
Before submi ing the URL of zmovies.co to the dataset and running the IITMAF software solution, we made some modifications to the solution configuration file.The configuration file enables the user to set limits to the collection and analysis to shorten the duration of a dataset's analysis or to increase the volume of information reported.We set the scraper depth to 3, the maximum number of URLs scraped to 300, and the dynamic analysis submission limit to 25, as shown in Figure 5. Once the configuration was complete, the dispatcher API and requester client of the IITMAF software solution were executed, initiating the iterative URL collection as shown in Figure 6.The iterative URL collection function extracted 87 unique URLs from the given website URL with any duplicate URLs removed from the list, while 20 URLs were extracted from the webpage of the given URL and queued for analysis.
For the analysis, any URLs pointing to files were downloaded, with some of the files downloaded shown in Figure 7. Once the static and dynamic analysis was complete, a report was generated, with a sample of the report presented in Figure 7.Although the submi ed URL and its scraped files and URLs returned no YARA rule matches, the iterative URL collection and dynamic analysis functions supplied a high number of results, with Figure 7 showing some of the URLs scraped from the given website and the Hybrid Analysis reports generated.
To determine if zmovies.co was associated with malware or malicious activity, we submi ed some of the internal and external URLs collected from the website to VirusTotal and checked the generated Hybrid Analysis reports for malicious indicators.While we found a few malicious indicators from the URLs collected, the Hybrid Analysis report of a file scraped from zmovies.co indicated malicious activity.
Figure 8 shows the Hybrid Analysis report for a JavaScript file downloaded from zmovies.co that marks the file as "suspicious" and classifies it as a Trojan HTML Agent.In conclusion, the IITMAF software solution was able to detect a potentially malicious file collected from an illicit IPTV website and classify it.This shows that the framework can collect URLs and files from illicit IPTV websites and identify malicious activity.

Findings
This experiment completed for evaluating the IITMAF entailed submi ing a research sample of illegal IPTV technology URLs to the IITMAF and comparing the analysis results with results from VirusTotal.The research sample had a total of 60 URLs consisting of 35 illicit IPTV website URLs, 13 app store URLs (from 7 app stores), and 12 standalone software executable and add-on URLs.Less illicit IPTV app store and software URLs were included in the research sample, as they were more difficult to locate in comparison to websites, which are more accessible using search engines.
For this evaluation (Section 3.3), each sample URL submi ed to the IITMAF collected up to 25 URLs, which were then submi ed for analysis in addition to the research sample URL.Once the analysis reports had been generated for the research sample, the number of URLs in the research sample identified as malicious was recorded and compared with the results of the research sample URLs submi ed to VirusTotal (Table 1).Moreover, the number of URLs in the research sample that scraped URLs or files identified as malicious by the IITMAF was recorded.These results were then compared with the number of URLs in the research sample that identified malicious-related files from 2022 for the domain of the URLs in VirusTotal.Only malicious files from 2022 were used to determine the number of malicious relations for a URL, as VirusTotal is a community-based threat intelligence platform that identifies malware previously submi ed that may not be relevant to URLs in 2022.Hence, only recent malware from 2022 was counted.
Table 1 presents the findings of the experiment completed to evaluate the IITMAF software solution, which was conducted from the 8th to the 16th of August 2022.As Vi-rusTotal identifies related malicious files with a URL domain, we used the domains of the URLs in the research sample to determine whether a URL had related malicious files.Therefore, the related malicious file results for app stores are out of seven, as the thirteen app store URLs in the research sample only include seven different domains.The experiment results show that the IITMAF solution detected more URLs as malicious than VirusTotal overall, with the IITMAF identifying 32 out of the 60 URLs of the research sample as malicious, while VirusTotal only identified 23 as malicious.Additionally, the IITMAF solution detected related malicious files or URLs for 31 of the research sample URLs, whereas VirusTotal identified 26 malicious relations from 2022.

Discussion
Although the IITMAF identified more illicit IPTV technologies as malicious than Vi-rusTotal, VirusTotal classified a greater number of illicit IPTV app store and software URLs as malicious than the IITMAF.This is because the app store and software URLs often pointed to Android APK files or had relations with these files, which the dynamic analysis feature of the IITMAF solution is unable to execute and analyse, as it uses a Windows sandbox environment by default.Conversely, the IITMAF was still able to identify 12 of the 25 app store and software URLs as suspicious, and thus, only 8 out of 25 were classified as benign.Moreover, the IITMAF identified ransomware TTPs for three files scraped from an illicit IPTV app store included in the research sample.This demonstrates that the IITMAF is still capable of providing actionable information for a range of illegal IPTV technologies even if it has limitations when analysing APK files.Finally, the IITMAF primarily focuses on the analysis of illicit IPTV websites and the extraction of files and URLs from them.Hence, due to time constraints, the dynamic analysis of Android files that require a different sandbox environment was not implemented.
Figure 9 shows some of these collected URLs for an illicit IPTV website (moviesjoy.to) that includes URLs with external domains potentially used for malvertising.The VirusTotal report for one of the external domains collected for this IPTV website, namely "static.zdassets.com",identified many recent malicious related files that had file names used in intrusive advertisements, which is common in illicit IPTV websites, such as "An-tiVirusExe.exe"and "PCCleaner" (h ps://www.virustotal.com/gui/domain/static.zdassets.com/relations(accessed on 25 August 2023)).Moreover, another URL to an external domain, bigcache.ml,had a URL path pointing to an alleged JQuery library JavaScript file.The VirusTotal report of this URL reported two malicious and one suspicious antivirus alert in addition to classifying the domain as a malware site.This implies that the iterative URL collection function is effective at collecting URLs from a website including any external domains that could be used for malvertising.Hence, the IITMAF is capable of iteratively scraping potentially malicious URLs from a website.Thus, this demonstrates that the webpage URL collection function is effective at collecting files and URLs from a webpage and submi ing them for analysis.Conversely, a limitation of both the URL collection functions is that website CAPTCHA security mechanisms are often triggered when scraping a website, limiting the number of URLs and files that are collected.The likelihood of a CAPTCHA mechanism triggering is increased further where the HTTP requests made by the URL collection functions are routed through Tor.While disabling Tor for the URL collection functions could enable more URLs to be collected, this would make the IITMAF host's public IP address visible to potentially malicious websites, posing a security risk.For this aim, if the Tor function needs to be disabled, it is required to limit the functionality to URL collection.The results of this experiment suggest that the URL collection functions are sufficient even with this limitation in place.
Table 2 shows the correlation from CTI sources collected for all the IPTV websites investigated in this research (Section 4).The  Thus, Table 2 shows that the signature and number of files that a webpage tries to download highly correlate with the visitor's signatures (value 0.94).Further, a correlation of ~0.67 was observed between the number of fileless signatures and the extracted URLs.This la er correlation can be explained in the form of the delivery of fileless threats, which is the general case through browser exploitations.
For the a ribution of malicious threats, the contemporary approach is the identification of the techniques and tactics.This, in return, reveals the modus operandi of the attacker and renders the a ribution possible [33].Therefore, within our experiment, we have also collected the Techniques, Tactics, and Procedures of the MITRE ATT and CK framework and presented below the most common techniques that have been utilised on these IPTV sources: The breakdown of the use of these techniques can be found in Figure 10.

Conclusions
This paper presented the IITMAF, an effective tool for automatically detecting, collecting, and analysing malware from illegal IPTV technologies.We evaluated the framework through a series of use cases and collections where we demonstrated the effectiveness of the IITMAF and showcased the results that can be achieved using the IITMAF when compared to existing solutions.The evaluation results show that the is more effective at detecting malicious capabilities and behaviour from illicit IPTV technologies overall, while VirusTotal is more effective at detecting malware from illicit IPTV app stores and software files.
Although VirusTotal identified a greater volume of app store and software URLs as malicious, the IITMAF was intended primarily for analysing illicit IPTV websites, with its iterative URL collection results capable of identifying possible external malvertising domains for an illicit website.
Moreover, the IITMAF produces a more comprehensive analysis report than other solutions, with matched YARA (the pa ern-matching tool for malware) rules and dynamic analysis results classifying malware and supplying related TTPs.From the findings (Section 4), the YARA rules detected malicious capabilities in an executable file relating to privilege escalation and a malicious document format.Likewise, the behavioural analysis results of an HTML webpage classified the detected malicious behaviour as an indicator of ransomware.This demonstrates that the IITMAF can supply actionable threat intelligence for malware detected in the illicit IPTV ecosystem.Most common Techniques used by IPTV URLs.

Limitations and Future Work
While the experiment results imply that the IITMAF is effective at collecting and analysing malware from illegal IPTV technologies, it does have limitations.One limitation of the IITMAF software is that the time to complete the analysis is high, as multiple URLs and files are submi ed to dynamic analysis, which can take time.In addition to this, the iterative approach employed in the methodology extends the analysis time even further.However, dynamic analysis and URL collection limits can be set to shorten this duration, providing configurability for the IITMAF.The depth of the recursion can be set using the internal parameters of the IITMAF.For this reason, per comparison, we have focused on the correct identification of the URL and IPTV resource.
Another limitation of the framework is that it provides limited analysis results for archive and Android-specific files, as they cannot be executed in the default Windows sandbox.Despite any limitations, the IITMAF focuses chiefly on the analysis of illicit IPTV websites, but we plan to address these limitations in the future work section.Additionally, our solution presents a guideline for the digital investigation of malicious files for the illegal IPTV domain rather than constituting a real-time application for stopping users from installing these applications, as the installation and access in most cases are conducted by the user themself without understanding/knowing the malicious intent behind.For the real-time application of the IITMAF, the signatures that are generated with the in-depth URL analysis would include the signatures that are branching from the suspicious/malicious URL itself.These in turn can be fed into IPS/IDS systems or Unified Threat Management (UTM) systems to cover a wider area of the neighbouring URLs where, without the analysis provided by the IITMAF, such systems can only block the front-facing URL content and the dynamically loaded content, and redirections would be omi ed.Similarly, without the IITMAF, this analysis on the neighbouring URLs and the dynamic content would have to be performed by the analyst otherwise.
Overall, the IITMAF provides a successful solution to the defined problem, providing cybersecurity professionals with both manual and automatic methodologies for identifying malware threats in the illicit IPTV ecosystem in addition to an automated solution for detecting, collecting, and analysing malware from illegal IPTV technologies.
The IITMAF primarily aims to analyse illicit IPTV websites with its depth-first URL collection results.This functionality reveals possible external malvertising domains for an illicit website, including the loaded dynamic content.Although the URL and file analysis functionality meets the core requirements and detected a greater number of URLs as malicious than VirusTotal in this evaluation experiment, it has two minor limitations.One of these limitations is that for the submi ed research sample in this evaluation experiment, only 4 out of the 60 URLs analysed had YARA rule matches indicating malicious capabilities.Nonetheless, 31 of the URLs had YARA rule matches for base64 encoded functions, implying that the static analysis functionality works properly.Additionally, one of the URLs pointing to an executable file had multiple YARA matches suggesting that it was malicious, as shown in Figure 11.This indicates that the signature-based malware detection functionality for IITMAF using YARA rules is still relevant and can detect malicious capabilities.The other limitation of the IITMAF analysis functionality is that it was unable to conduct dynamic analysis for ZIP files, as Hybrid Analysis does not automatically extract files from them.This suggests that the IITMAF has limitations when analysing illicit IPTV addons, as they are frequently distributed using ZIP files.
In summary, the automated URL and file analysis functions of the IITMAF are effective at detecting malware and supplying detailed information about a file or URL's capabilities and behaviour.While there are some minor limitations, the analysis functionality meets the key requirements set and passes all the major unit tests.
In the future, we intend to improve the IITMAF by implementing additional features to the automated solution.One of these features would be downloading files from scraped webpage URLs depending on if a URL HTTP request header's content type relates to a non-webpage file.This would ensure that non-webpage files referenced by URLs that do not identify a file extension in the URL path are downloaded.Additional functionality to be developed would be mechanisms to bypass website CAPTCHAs, providing more accurate analysis results of malicious activity on illicit IPTV websites.Moreover, we plan to implement a Hybrid Analysis interface function for performing dynamic analysis on Android APK files, as many illicit IPTV technologies consist of Android applications that cannot be dynamically analysed in a standard Windows-based sandbox.Finally, as the Hybrid Analysis API is unable to automatically unpack archive files, we aim to implement functionality that automatically unpacks downloaded archive files and submits the contents for analysis, as illicit IPTV software add-ons can be distributed in an archive format.

Figure 1 .
Figure 1.Flow chart of the framework's automated malware collection and analysis methodology.

Figure 2 .
Figure 2. DFD design of the IITMAF software solution.

Figure 3 .
Figure 3. Dispatcher console showing files being downloaded from an illicit IPTV website.

Figure 5 .
Figure 5. Configuration file of the IITMAF software solution.

Figure 6 .
Figure 6.Dispatcher console showing the iterative URL collection functionality.

Figure 7 .
Figure 7. Sample analysis report showing dynamic analysis and iterative URL collection results.

Figure 8 . 4 h
Figure 8. Sample of a Hybrid Analysis report showing the analysis results for a scraped file.

Figure 9 .
Figure 9. IITMAF report showing collected URLs for one of the illicit webpages.

Figure 10 .
Figure 10.Pie chart for the most common ATT and CK techniques observed.

Figure 11 .
Figure 11.IITMAF report showing matched YARA rules for a URL.

Table 1 .
Experiment results of malicious illicit IPTV URLs and files.

Table 2 .
table includes details of the following fields: Number of Files: the number of files that are extracted from a webpage. Number of Compromised Hosts: from all the URLs that are linking out from this webpage, the number of them that are marked malicious. Maximum Threat Score: a metric calculated by Hybrid Analysis (Hybrid Analysis 2022) demonstrating the threat score of a link. Malicious File and Fileless Signatures: the number of alerts raised by the crafted YARA rules analysis for file malware or fileless malware signatures, respectively. Extracted URLs: the number of URLs that are extracted from a webpage.Correlation matrix of the investigated IPTV webpages.