Gegevens mining? And how can I perform it on my webstek? Stack Overflow
I’m preparing my graduation project from laptop science, I made this webstek and it’s running flawlessly but my supervisor requested mij to apply gegevens mining on the webstek. But I don’t understand what I should do. The webstek is a social network, each user will have a profile and blog and access to some e-books that required you to be registered so you can download. The webstek also contains a music server that contains songs that a registered user can choose a song to download or to add it spil a dearest ter his profile pagina, the webstek contains ads (I used OpenX script), so this is most of the webstek services where I can perform gegevens mining, the webstek is www.sy-stu.com.
I need ideas and what is the best way to present it ter the vraaggesprek?
You can ask your professor what wasgoed his intention of using gegevens mining. Gegevens mining algorithms can do various tasks, you need very first define what you want to accomplish and then find some algorithms for this and technical possibilities.
Some ideas that came to my mind about usage of gegevens mining te your project:
- you can use gegevens mining to find what songs (ebooks,etc.) can be favorited by a user based on other people favorites songs (find similarities, very likely association rules would be a good algorithm for this).
- you can use some clustering algorithms to group users based on some parameters and suggest them that they could become a connections with other people from the same group (if you have something like this)
Firstly, ask for clarification from your supervisor. Don’t say ‘What do you mean?’, but ask ‘Are you expecting something like this?’ because it shows that you’ve at least thought about it.
If you can’t think of anything, or your supervisor is vague, perform some elementary gegevens retrieval and analysis, e.g.
- most active members
- the most / least popular songs and books.
- number of ads clicked etc
- most popular webstek features
Just elementary analysis should suffice – you aren’t doing a statistics degree. Work out the most songs downloaded ter a day or vanaf user, the average songs vanaf user, how many users visit each day and how many sign up and never visit.
The purpose is to demostrate that your webstek is logging all activity, so that when you are asked ‘how many books did the 20 most active users download te June’ you will be able to work out the response.
The alternative is a webstek that just runs and you don’t have any skill of how your users are behaving and what they are doing, which means you aren’t able to concentrate on things that they find significant.